(19) 



J 



Europaisches Patentamt 
European Patent Office 
Office euro pee n des brevets 



(12) 



(ID EP 0 612 348 B1 

EUROPEAN PATENT SPECIFICATION 



(45) Date of publication and mention 
of the grant of the patent: 
23.04.2003 Bulletin 2003/17 

(21) Application number: 92924237.8 

(22) Dateof filing: 02.11.1992 



(51) lntCl7: C12N 15/12, C12P 21/02, 
A61K 38/17, C12N 5/12, 
C07K 14/51 

(86) International application number: 
PCT/US92/09430 

(87) International publication number: 

WO 93/009229 (13.05.1993 Gazette 1993/12) 



(54) RECOMBINANT BONE MORPHOGENETIC PROTEIN HETERODIMERS, COMPOSITIONS AND 
METHODS OF USE 

REKOMBINANTE KNOCHENMORPHOGENETISCHE PROTEIN HETERODIMERE, 
ZUSAMMENSETZUNGEN UND VERFAHREN 2UR VERWENDUNG 

PROTEINES HETERODIMERES MORPHOGENETIQUES D'OS DE RECOMBINAISON, 
COMPOSITIONS ET PROCEDES D' UTILISATION 



(84) Designated Contracting States: 

AT BE CH DE DK ES FR GB GR IE IT LI LU MC NL 
SE 

(30) Priority: 04.11.1991 US 787496 
07.04.1992 US 864692 

(43) Date of publication of application: 
31.08.1994 Bulletin 1994/35 

(73) Proprietor: Genetics Institute, LLC 
Cambridge, MA 02140 (US) 

(72) Inventors: 

• ISRAEL, David 
Concord, MA 01742 (US) 

♦ WOLFMAN, Neil, M. 
Dover, MA 02030 (US) 

(74) Representative: VOSSIUS & PARTNER 
Siebertstrasse 4 

81675 Munchen (DE) 



(56) References cited: 
EP-A- 0 433 225 
WO-A-89/10409 
WO-A-90/11366 
WO-A-93/00049 



WO-A-89/09787 
WO-A-90/03733 
WO-A-91/18098 
US-A- 4 923 805 



> PROCEEDINGS OF THE NATIONAL ACADEMY 
OF SCIENCES OF USA vol. 87, March 1990, 
WASHINGTON US pages 2220 - 2224 WANG, E.A. 
ET AL. 'Recombinant human bone 
morphogenetic protein induces bone formation' 
cited in the application 

JOURNAL OF BIOLOGICAL CHEMISTRY vol. 
265, no. 22, 5 August 1990, BALTIMORE, MD US 
pages 13198 - 13205 SAMPATH, T.K. ET AL. 
'Bovine osteogenic protein Is composed of 
dimersof OP-1 and BMP-2A,two members of the 
transforming growth factor-beta superfamily' 
PROCEEDINGS OF THE NATIONAL ACADEMY 
OF SCIENCES OF USA vol. 87, no. 24, December 
1990, WASHINGTON US pages 9843 - 9847 
CELESTE, A.J. ET AL. 'Identification of 
transforming growth factor- beta family 
members present in bone-inductive protein 
purified from bovine bone' 
JOURNAL OF CELLULAR BIOCHEMISTRY 
Supplement 16F, 1992, page 76, abstract W026; 
WOZNEY, J.M. ET AL.: 'Regulation of 
chondrogenesis and osteogenesis by the BMP 
proteins' 



Note: Within nine months from the publication of the mention of the grant of the European patent, any person may give 
notice to the European Patent Office of opposition to the European patent granted. Notice of opposition shall be filed in 
a written reasoned statement, it shall not be deemed to have been filed until the opposition fee has been paid. (Art. 
99(1) European Patent Convention). 



Printed by Jouve, 75001 PARIS (FR) 

BEST AVAILABLE COPY 



EP 0 612 348 B1 



Description 

[0001] The present invention relates to a series of novel recombinant heterodimeric proteins useful in the field of 
treating bone defects, healing bone injury and in wound healing in general. The invention also relates to methods for 
5 obtaining these heterodimers, methods for producing them by recombinant genetic engineering techniques, and com- 
positions containing them. 

[0002] In recent years, protein factors which are characterized by bone or cartilage growth inducing properties have 
been isolated and identified. See, e.g., U. S. Patent No. 5,013,649, PCT published application WO90/11366 PCT 
published application WO91/05802 and the variety of references cited therein. See, also, PCT/US90/05903 which 
10 discloses a protein sequence termed OP-1 , which is substantially similar to human BMP-7, and has been reported to 
have osteogenic activity. 

[0003] A family of individual bone morphogenetic proteins (BMPs), termed BMP-2 through BMP-9 have been isolated 
and identified. Reference for the purposes of providing disclosure of these proteins and methods of producing them is 
made to co-owned U.S. Patent No. 6, 1 50,328 and the related applications recited in its preamble. Of particular interest, 
15 are the proteins termed BMP-2 and BMP-4, disclosed in the above-referenced patent; BMP-7, disclosed in US 
5,141,905; BMP-5, disclosed in US 5,106,748 and BMP-6, disclosed in US 5,187,076 BMP-8 is disclosed in US 
5,688!678. Additional members of the BMP family include BMP-1 BMP-9, disclosed in US 5,116,738; and BMP-3, 
disclosed in US 5,116,738 and PCT publication 89/01464. 

[0004] There remains a need in the art for other proteins and compositions useful in the fields of bone and wound 
20 healing. 

[0005] In one aspect, the invention provides a method for producing a heterodimeric protein having bone stimulating 
activity comprising culturing a selected host eel! containing a nucleotide sequence encoding a first selected BMP or 
fragment thereof and a nucleotide sequence encoding a second selected BMP or fragment thereof, said nucleotide 
sequences each being under the control of a suitable regulatory sequence capable of directing co-expression of said 
25 proteins, and isolating said heterodimeric protein from the culture medium, wherein said heterodimeric protein is a 
human BMP-2/5, BMP-2/6, BMP-4/5, BMP-4/6 or BMP-4/7 heterodimer. 

[0006] According to one embodiment of this invention, the host cell may be co-transfected with one or more vectors 
containing coding sequences for one or more BMPs. Each BMP polynucleotide sequence may be present on the same 
vector or on individual vectors transfected into the cell. Alternatively, the BMPs or their fragments maybe incorporated 
30 into a chromosome of the host cell. Additionally, a single transcription unit may encode single copy of two genes 
encoding a different BMP. 

[0007] According to another embodiment of this invention, the selected host cell containing the two polypeptide 
encoding sequences is a hybrid cell line obtained by fusing two selected, stable host cells, each host cell transfected 
with, and capable of stably expressing, a polynucleotide sequence encoding a selected first or second BMP or fragment 
35 thereof. 

[0008] In another aspect of the present invention, therefore, there are provided recombinant heterodimeric proteins 
comprising a protein or fragment of a first BMP in association with a protein or fragment of a second BMP. The het- 
erodimer may be characterized by bone stimulating activity. The heterodimers comprise a protein or fragment of BMP- 
2 associated with a protein or fragment of either BMP-5 or BMP-6 ; or a protein or fragment of BMP-4 associated with 
40 a protein or fragment of either BMP-5, BMP-6 or BMP-7. These heterodimers may be produced by co-expressing each 
protein in a selected host cell and isolating the heterodimer from the culture medium. 

[0009] As a further aspect of this invention a cell line is provided which comprises a first polynucleotide sequence 
encoding a first BMP or fragment thereof and a second polynucleotide sequence encoding a second BMP or fragment 
thereof, the sequences being under control of one or more suitable expression regulatory systems capable of co- 

45 expressing the BMPs as a heterodimer wherein said recombinant heterodimeric protein is a human BMP-2/5, BMP- 
2/6, BMP-4/5, BMP-4/6 or BMP-4/7 heterodimer. The cell line may be transfected with one or more than one polynu- 
cleotide molecule. Alternatively, the cell line may be a hybrid cell line created by cell fusion as described above. 
[0010] Another aspect of the invention is a polynucleotide molecule or plasmid vector comprising a polynucleotide 
sequence encoding a first selected BMP or fragment thereof and a polynucleotide sequence encoding a second se- 

so lected BMP or fragment thereof wherein said first selected BMP is BMP-2 or BMP-4, and said second selected BMP 
is BMP-5 or BMP-6, or wherein said first selected BMP is BMP-4 and said second selected BMP is BMP-7. The se- 
quences are under the control of at least one suitable regulatory sequence capable of directing co-expression of each 
protein or fragment. The molecule may contain a single transcription unit containing a copy of both genes, or more 
than one transcription unit, each containing a copy of a single gene. 

55 [001 1 ] As still another aspect of this invention there is provided a method for producing a recombinant heterodimeric 
protein having bone stimulating activity in a prokaryotic cell comprising culturing a selected host cell containing a 
polynucleotide sequence encoding a first selected BMP or fragment thereof; culturing a second selected host cell 
containing a polynucleotide sequence encoding a second selected BM P or fragment thereof; isolating monomelic forms 
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of each BMP protein from the culture medium and co-assembling a monomer of the first protein with a monomer of 
the second protein wherein said heterodimeric protein is a human BMP-2/5, BMP-2/6, BMP-4/5, BMP-4/6 or BMP-4/7 
heterodimer. The first protein and the second protein are different BMPs. The resulting biologically active heterodimer 
is thereafter isolated from the mixture. Preferred cells are E. coli. 

[0012] Thus, as further aspects of this invention recombinant BMP heterodimers produced in eukaryotic cells are 
provided, as well as suitable vectors or plasmids, and selected transformed cells useful in such a production method. 
[0013] Other aspects and advantages of the present invention are described further in the following detailed descrip- 
tion of preferred embodiments of the present invention. 

Figure 1 provides the DNA and amino acid sequences of human BMP-2 (SEQ ID NOs: 1 and 2). 

Figure 2 provides the DNA and amino acid sequences of human BMP-4 (SEQ ID NOs: 3 and 4). 

Figure 3 provides the DNA and amino acid sequences of human BMP-7 (SEQ ID NOs: 5 and 6). 

Figure 4 provides the DNA and amino acid sequences of human BMP-6 (SEQ ID NOs: 7 and 8). 

Figure 5 provides the DNA and amino acid sequences of human BMP-5 (SEQ ID NOs: 9 and 10). 

Figure 6 provides the DNA and amino acid sequences of human BMP-8 (SEQ ID NOs: 11 and 12). 

Figure 7 provides the DNA sequence of vector pALB2-781 containing the mature portoin of the BMP-2 gene (SEQ 

ID NOs: 13 and 14). 

Figure 8 compares the activity of CHO BMP-2 and CHO BMP-2/7 in the W20 alkaline phosphatase assay. 

Figure 9 compares the activity of CHO BMP-2 and CHO BMP-2/7 in the BGP (osteocalcin) assay. 

Figure 10 provides a comparison of the W-20 activity of E. coli produced BMP-2 and BMP-2/7 heterodimer. 

Figure 11 depicts BMP-3 DNA and amino acid sequence. 

Figure 12 provides a comparison of BMP-2 and BMP-2/6 in the W-20 assay. 

Figure 13 provides a comparison of the in vivo activity of BMP-2/6 and BMP-2. 

Figure 14 provides a comparison of BMP-2, BMP-6 and BMP-2/6 in vivo activity. 

[001 4] The present invention provides a method for producing recombinant heterodimeric proteins having bone stim- 
ulating activity, as well as the recombinant heterodimers themselves, and compositions containing them for bone- 
stimulating or repairing therapeutic use. 

[0015] As used throughout this document, the term 'heterodimer* is defined as a biologically-active protein construct 
comprising the association of two different BMP protein monomers or active fragments thereof joined through at least 
one covalent, disulfide linkage. A heterodimer of this invention may be characterized by the presence of between one 
to seven disulfide linkages between the two BMP component strands. 

[0016] According to the present invention, therefore, a method for producing a recombinant BMP heterodimer ac- 
cording to this invention comprises culturing a selected host cell containing a polynucleotide sequence encoding a first 
selected BMP or a biologically active fragment thereof and a polynucleotide sequence encoding a second selected 
BMP or a fragment thereof. The resulting co-expressed, biologically active heterodimer is formed within the host cell, 
secreted therefrom and isolated from the culture medium. Preferred embodiments of methods for producing the het- 
erodimeric proteins of this invention, are described in detail below and in the following examples. Preferred methods 
of the invention involve known recombinant genetic engineering techniques [See, e.g., Sambrook et al, "Molecular 
Cloning. A Laboratory Manual.", 2d edition, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989)]. However, 
other methods, such as conventional chemical synthesis may also be useful in preparing a heterodimer of this invention. 
[0017] BMP heterodimers generated by this method are produced in a mixture of homodimers and heterodimers. 
This mixture of heterodimers and homodimers may be separated from contaminants in the culture medium by resort 
to essentially conventional methods, such as classical protein biochemistry or affinity antibody columns specific for 
one of the BMPs making up the heterodimer. Additionally, if desired, the heterodimers may be separated from ho- 
modimers in the mixture. Such separation techniques allow unambiguous determination of the activity of the het- 
erodimeric species. Example 4 provides one presently employed purification scheme for this purpose. 
[0018] The recombinant heterodimers of this invention produced by these methods involve the BMPs designated 
human BMP-2, human BMP-4, human BMP-5, human BMP-6 and human BMP-7 and are human BMP-2/5, BMP-2/6, 
BMP-4/5, BMP-4/6 or BMP-4/7 heterodimers. 

[0019] BMPs specifically identified above may be employed in heterodimers useful for veterinary, diagnostic or re- 
search use. 

[0020] Human BMP-2 is characterized by containing substantially the entire sequence, or fragments, of the amino 
acid sequence and DNA sequence disclosed in Figure 1 . Human BMP-2 proteins are further characterized as disulfide- 
linked dimers and homodimers of mature BMP-2 subunits. Recombinantly-expressed BMP-2 subunits include protein 
species having heterogeneous amino termini. One BMP-2 subunit is characterized by comprising amino acid #249 
(Ser) - #396 (Arg) of Figure 1 (SEQ (D NOs: 1 and 2). Another BMP-2 subunit is characterized by comprising amino 
acid #266 (Thr) - #396 (Arg) of Figure 1 . Another BMP-2 subunit is characterized by comprising amino acid #296 (Cys) 
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- #396 (Arg) of Figure 1 . A mature BMP-2 subunit is characterized by comprising amino acid #283 (Gin) - #396 (Arg) 
of Figure 1 . This latter subunit is the presently most abundant protein species which results from recombinant expres- 
sion of BMP-2 (Figure 1). However, the proportions of certain species of BMP-2 produced may be altered by manipu- 
lating the culture conditions. BMP-2 may also include modifications of the sequences of Figure 1, e.g., deletion of 
amino acids #241-280 and changing amino acid #245 Arg to He, among other changes. 

[0021] As described in detail in United States Patent No. 6,150,328 human BMP-2 may be produced by culturing a 
cell transformed with a DNA sequence comprising the nucleotide coding sequence from nucleotide #356 to #1543 in 
Figure 1 and recovering and purifying from the culture medium one or more of the above-identified protein species, 
substantially free from other proteinaceous materials with which it is co-produced. Human BMP-2 proteins are char- 
acterized by the ability to induce bone formation. Human BMP-2 also has in vitro activity in the W20 bioassay. Human 
BMP-2 is further characterized by the ability to induce cartilage formation. Human BMP-2 may be further characterized 
by the ability to demonstrate cartilage and/or bone formation activity in the rat bone formation assay described in the 
above-referenced application. 

[00221 Human BMP-4 is characterized by containing substantially the entire sequence, or fragments, of the amino 
acid sequence and DNA sequence disclosed in Figure 2 (SEQ ID NOs: 3 and 4). Human BMP-4 proteins are further 
characterized as disulfide-linked dimers and homodimers of mature BMP-4 subunits. Recombinantly-expressed BMP- 
4 subunits may include protein species having heterogeneous amino termini. A mature subunit of human BMP-4 is 
characterized by an amino acid sequence comprising amino acids #293 (Ser) - #408 (Arg) of Figure 2. Other ammo 
termini of BMP-4 may be selected from the sequence of Figure 2. Modified versions of BMP-4, including proteins further 
truncated at the amino or carboxy termini, may also be constructed by resort to conventional mutagenic techniques. 
[0023] As disclosed in patent US 6,150,328, BMP-4 may be produced by culturing a cell transformed with a DNA 
sequence comprising the nucleotide coding sequence from nucleotide #403 to nucleotide #1626 in Figure 2 and re- 
covering and purifying from the culture medium a protein containing the amino acid sequence from amino acid #293 
to #408 as shown in Figure 2, substantially free from other proteinaceous materials with which it is co-produced. BMP- 
4 proteins are capable of inducing the formation of bone. BM P-4 proteins are capable of inducing formation of cartilage. 
BMP-4 proteins are further characterized by the ability to demonstrate cartilage and/or bone formation activity in the 
rat bone formation assay. 

[0024] Human BMP-7 is characterized by containing substantially the entire sequence, or fragments, of the amino 
acid sequence and DNA sequence disclosed in Figure 3. Human BMP-7 proteins are further characterized as d.sulf. de- 
linked dimers and homodimers of mature BMP-7 subunits. Recombinantly-expressed BMP-7 subunits include protein 
species having heterogeneous amino termini. One BMP-7 subunit is characterized by comprising ammo acid #293 
(Ser) - #431 (His) of Figure 3 (SEQ ID NOs: 5 and 6). This subunit-is the most abundantly formed protein produced 
by recombinant expression of the BMP-7 sequence. Another BMP-7 subunit is characterized by comprising amino 
acids #300 (Ser) - #431 (His) of Figure 3. Still another BMP-7 subunit is characterized by comprising amino acids #31 6 
(Ala) - #431 (His) of Figure 3. Other amino termini of BMP-7 may be selected from the sequence of Figure 3. Similarly, 
modified versions, including proteins further truncated at the amino or carboxy termini, of BMP-7 may also be con- 
structed by resort to conventional mutagenic techniques. 

[0025] As disclosed in patent US 5,141 ,905, BMP-7 may be produced by culturing a cell transformed with a DNA 
sequence comprising the nucleotide coding sequence from nucleotide #97 to nucleotide #1389 in Figure 3 and recov- 
ering and purifying from the culture medium a protein containing the amino acid sequence from amino acid #293 to 
#431 as shown in Figure 3, substantially free from other proteinaceous or contaminating materials with which it is co- 
produced. These proteins are capable of stimulating, promoting, or otherwise inducing cartilage and/or bone formation. 
[0026] Human BMP-6 is characterized by containing substantially the entire sequence, or fragments, of the amino 
acid sequence and DNA sequence disclosed in Figure 4. Human BMP-6 proteins are further characterized as disulf de- 
linked dimers of mature BMP-6 subunits. Recombinantly-expressed BMP-6 subunits may include protein species hav- 
ing heterogeneous amino termini. One BMP-6 subunit is characterized by comprising amino acid #375 (Ser) - #513 
(His) of Figure 4 (SEQ ID NOs: 7 and 8). Other amino termini of BMP-6 may be selected from the sequence of Figure 
4. Modified versions, including proteins further truncated at the amino or carboxy termini, of BMP-6 may also be con- 
structed by resort to conventional mutagenic techniques. 

[0027] As described in detail in United States Patent No. 5,187,076 human BMP-6 may be produced by culturing a 
cell transformed with a DNA sequence comprising the nucleotide coding sequence from nucleotide #160 to #1698 in 
Figure 4 and recovering and purifying from the culture medium a protein comprising amino acid #375 to #51 3 of Figure 
4 substantially free from other proteinaceous materials or other contaminating materials with which it is co-produced. 
Human BMP-6 may be further characterized by the ability to demonstrate cartilage and/or bone formation activity in 
the rat bone formation assay. 

[0028] Human BMP-5 is characterized by containing substantially the entire sequence, or fragments, of the amino 
acid sequence and DNA sequence disclosed in Figure 5 (SEQ ID NOs: 9 and 1 0). Human BMP-5 proteins are further 
characterized as disulfide-linked dimers of mature BMP-5 subunits. Recombinantly-expressed BMP-5 subunits may 
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include protein species having heterogeneous amino termini, one BMP-5 subunit is characterized by comprising amino 
acid #329 (Ser) - #454 (His) of Figure 5. Other amino termini of BMP-5 may be selected from the sequence of Figure 
5. Modified versions, including proteins further truncated at the amino or carboxy termini, of BMP-5 may also be con- 
structed by resort to conventional mutagenic techniques. 
5 [0029] As described in detail in United States Patent No. 5,543,394 human BMP-5 may be produced by culturing a 
cell transformed with a DNA sequence comprising the nucleotide coding sequence from nucleotide #701 to #2060 in 
Figure 5 and recovering and purifying from the culture medium a protein comprising amino acid #329 to #454 of Figure 
5, substantially free from other proteinaceous materials or other contaminating materials with which it is co-produced. 
Human BMP-5 may be further characterized by the ability to demonstrate cartilage and/or bone formation activity in 

10 the rat bone formation assay described in the above-referenced application. 

[0030] Each above described BMP protein in its native, non-reduced dimeric form may be further characterized by 
an apparent molecular weight on a 12% Laemmlt gel ranging between approximately 28kD to approximately 40kD. 
Analogs or modified versions of the DNA and amino acid sequences described herein which provide proteins or active 
fragments displaying bone stimulating or repairing activity in the rat bone formation assay described below in Example 

is 9, are also classifed as suitable BMPs for use in this invention, further provided that the proteins or fragments contain 
one or more Cys residues for participation in disulfide linkages. Useful modifications of these sequences may be made 
by one of skill in the art with resort to known recombinant genetic engineering techniques. Production of these BMP 
sequences in mammalian cells produces homodimers, generally mixtures of homodimers having heterologous N ter- 
mini. Production of these BMP sequences in E. coli produces monomeric protein species. 

20 [0031] Thus, according to this invention one recombinant heterodimer of the present invention comprises the asso- 
ciation of a human BMP-2, including, e.g., a monomeric strand from a mature BMP-2 subunit as described above or 
an active fragment thereof, bound through one or up to seven covafent, disulfide linkages to a human BMP-5 including, 
e.g., a monomeric strand from a mature BMP-5 subunit as described above or an active fragment thereof. Another 
recombinant heterodimer of the present invention comprises the association of a human BMP-2, as described above, 

25 bound through one or up to seven covalent, disulfide linkages to a human BMP-6, including, e.g., a monomeric strand 
from a BMP-6 subunit as described above or an active fragment thereof. 

[0032] Still another recombinant heterodimer of the present invention comprises the association of a human BMP- 
4, including, e.g., a monomeric strand of a BMP-4 subunit as described above or an active fragment thereof, bound 
through one or up to seven covalent, disulfide linkages to a human BMP-5, as described above. Another recombinant 
30 heterodimer of the present invention comprises the association of a human BMP-4, as described above, bound through 
one or more covalent, disulfide linkages to a human BMP-6, as described above. Another recombinant heterodimer 
of the present invention comprises the association of a human BMP-4, as described above bound through one or more 
covalent, disulfide linkages to a human BMP-7, as described above. 

[0033] The disulfide linkages formed between the monomeric strands of the BMPs may occur between one Cys on 
35 each strand. Disulfide linkages may form between two Cys on each BMP. Disulfide linkages may form between three 
Cys on each BMP. Disulfide linkages may form between four Cys on each BMP Disulfide linkages may form between 
five Cys on each BMP. Disulfide linkages may form between six Cys on each BMP. Disulfide linkages may form between 
seven Cys on each BMP. These disulfide linkages may form between adjacent Cys on each BMP or between only 
selected Cys interspersed within the respective protein sequence. Various heterodimers having the same BMP com- 
40 ponent strands may form with different numbers of disulfide linkages. Various heterodimers having the same BMP 
component strands may form with disulfide bonds at different Cys locations. Different heterodimers encompassed by 
this invention having the same BMP components may differ based upon their recombinant production in mammalian 
cells, bacterial cells, insect or yeast cells. 

[0034] These recombinant heterodimers may be characterized by increased alkaline phosphatase activity in the W20 
45 mouse stromal cell line bioassay (Example 8) compared to the individual BMP homodimers, one strand of which forms 
each heterodimer. Further, these heterodimers are characterized by greater activity in the W20 bioassay than is pro- 
vided by simple mixtures of the individual BMP dimers. Preliminary characterization of heterodimers measured on the 
W20 bioassay have demonstrated that heterodimers of BMP-2 with BMP-5, BMP-6 or BMP-7 are very active. Similarly, 
heterodimers of BMP-4 with BMP-5, BMP-6 or BMP-7 are strongly active in the W20 bioassay. 
so [0035] Heterodimers of this invention may also be characterized by activity in bone growth and stimulation assays. 
For example, a heterodimer of this invention is also active in the rat bone formation assay described below in Example 
9. The heterodimers are also active in the osteocalcin bioassay described in Example 8. Other characteristics of a 
heterodimer of this invention include co-precipitation with anti-BMP antibodies to the two different constituent BMPs, 
as well as characteristic results on Western blots, high pressure liquid chromatography (HPLC) and on two-dimensional 
55 gels, with and without reducing conditions. 

[0036] One embodiment of the method of the present invention for producing recombinant BMP heterodimers in- 
volves culturing a suitable cell line, which has been co-transfected with a DNA sequence coding for expression of a 
first BMP or fragment thereof and a DNA sequence coding for expression of a second BMP or fragment thereof, under 
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the control of known regulatory sequences. The transformed host cells are cultured and the heterodimeric protein 
recovered and purified from the culture medium. ^h;™ 
[0037] In another embodiment of this method which is the presently preferred method of expression of the heterodm- 
ers of this invention, a single host cell, e.g., a CHO DUKX cell, is co-transfected with a f.rst DNA molecule containing 
a DNA sequence encoding one BMP and a second DNA molecule containing a DNA sequence encoding a second 
selected BMP. One or both plasmids contain a selectable marker that can be used to establish stable cell lines ex- 
pressing the BMPs. These separate plasmids containing distinct BMP genes on seperate transcription units are mixed 
and transfected into the CHO cells using conventional protocols. A ratio of plasmids that gives maximal expression of 
activity in the W20 assay, qenerally, 1:1, is determined. ^ n 

For example, as described in detail in Examp.e 3, equal ratios of a plasmid containing the first BMP and a 
dihydrofolate reductase (DHFR) marker gene and another plasmid containing a second BMP and a DHFR ^^ene 
car i be co-introduced into DHFR-deficient CHO cells, DUKX-BII. by calcium phosphate coprecipitation and transfection^ 
electroporation microinjection, protoplast fusion or lipofection. Individual DHFR expressing transformants are selected 
Tc Sn alpha media with dialyzed fetal calf serum by conventional means. DHFR+ cells conta.mng increased 
gene c^s can'be selected for propagation in increasing concentrations of metho*exate ^^SfS 
fn 0 02 0.1, 0.5 and 2.0 uM MTX) according to the procedures of Kaufman and Sharp, J. Mol. B ol„ 15J.60 -629 
(1982) and Kaufman et al, Mol. Cell Biol., 5:1750 (1983). Expression of the heterodimer or at least one BMPNnked 

0 DHFR should increase with increasing levels of MTX resistance. Cells that stably express either o both BMP/DHFR 
qenes will survive. However at a high frequency, cell lines stably incorporate and express both plasmids that were 
pZ^dun^g the initial transfection The conditioned medium is thereafter harvested and the heterodimer isolated 
S conventional methods and assayed for activity. This approach can be employed with be 
[0039] As an alternative embodiment of this method, a DNA molecule containing one selected BMP 9™ may be 

ransfected into a stable cel. line which already expresses another selected BMP gene. For example 

detail in Example 3 below, a stable CHO cell line expressing BMP-7 with the DHFR marker (designated 7MB9) [Genetics 
Institute, Inc] is transfected with a plasmid containing BMP-2 and a second selectable marker gene °-9^"^'" 
resistance (Neo). After transfection, the cell is cultured and suitable cells selected by treatment with MTX .and the 
antibiotic, G-41 8 Surviving cells are then screened for the expression of the heterodimer. This expression system has 
the advantage of permitting a single step selection. 

[0040] Alternative dual selection strategies using different cell lines or different markers can al so be use d. Fo ex- 
ample the use of an adenosine deaminase (ADA) marker to amplify the second BMP gene ,n a stable CHO cell line 
expressing a different BMP with the DHFR marker may be preferable, since the level of expression can be increased 
using deoxycoformycin (DCR-mediated gene amplification. (See the ADA containing plasmid described in Example 

vector containing a distinct marker and selected for dual resistance and BMP coexpression trQncfo „ tinn 
[0041] Still another embodiment of a method of expressing the heterodimers of this mvention includes transfec ng 
the host cell with a single DNA molecule encoding multiple genes for expression either on a single transcription , unit 
or on separate transcription units. Multicistronic expression involves multiple polypeptides encoded I with mas ngle 
transcript, which can be efficiently translated from vectors utilizing a leader sequence, eg., from the EMC virus .from 
poliovirus or from other conventional sources of leader sequences. Two BMP genes and a 

expressed wtthin a single transcription unit. For example, vectors containing the configuration B ^^ n B ^^ R 
FR or BMPx-EMC-BMPy-EMC-DHFR can be transfected into CHO cells and selected and amplified using the DHFR 
marker A plasmid may be constructed which contains DNA sequences encoding two different BMPs. one or more 
marker genes and a suitable leader or regulatory sequence on a single transcription unit. 

[0042] Similarly, host cells may be transfected with a single plasmid which contains separate transcnpt on units for 
each BMP. A selectable marker, e.g., DHFR, can be contained on a another transcription unit, or alt ernative^ as the 
second cistron on one or both of the BMP genes. These plasmids may be transfected into a selected host ceH for 
expression of the heterodimer, and the heterodimer isolated from the cells or culture medium as described above. 
[0043] Another embodiment of this expression method involves cell fusion. Two stable cell lines i which express se- 
eded BMPs, such as a cell line expressing BMP-2 (e.g., 2EG5) and a cell line expressing BMP-7 (*g fMM). de- 
veloped using the DHFR/MTX gene amplification system and expressing BMP at high levels, as described ,n Example 

1 and in the above incorporated U.S. applications, can be transfected with one of several dominant marker genes £ 
g neo". hygromycin", GPT). After sufficient time in coculture (approximately one day) one resultant cell line expressing 
one BMP and a dominant marker can be fused with a cell line expressing a different BMP and preferably a different 
marker using a fusigenic reagent, such as polyethylene glycol, Sendai virus or other known agent. 

[0044] The resulting cell hybrids expressing both dominant markers and DHFR can be selected using the eppn^nate 
culture conditions, and screened for coexpression of the BMPs or their fragments. The selected hybrid eel contains 
sequences encoding both selected BMPs, andthe heterodimer is formed in the cell and then secreted. The heterodimer 
is obtained from the conditioned medium and isolated and purified therefrom by conventional methods (see e.g.. Ex- 
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ample 4). The resulting heterodimer may be characterized by methods described herein. 

[0045] Cell lines generated from the approaches described above can be used to produce co-expressed, het- 
erodimeric BMP polypeptides. The heterodimeric proteins are isolated from the cell medium in a form substantially 
free from other proteins with which they are co-produced as well as from other contaminants found in the host cells by 

5 conventional purification techniques. The presently preferred method of production is co-transfection of different vectors 
into CHO cells and methotrexate-mediated gene amplification. Stable cell lines may be used to generate conditioned 
media containing recombinant BMP that can be purified and assayed for in vitro and in vivo activities. For example, 
the resulting heterodimer-p reducing cell lines obtained by any of the methods described herein may be screened for 
activity by the assays described in Examples 8 and 9, RNA expression, and protein expression by sodium dodecyl 

10 sulfate poiyacrylamide gel electrophoresis (SDS-PAGE) . 

[0046] The above-described methods of co-expression of the heterodimers of this invention utilize suitable host cells 
or cell lines. Suitable cell preferably include mammalian cells, such as Chinese hamster ovary cells (CHO). The se- 
lection of suitable mammalian host cells and methods for transformation, culture, amplification, screening and product 
production and purification are known in the art. See, e.g., Gething and Sambrook, Nature , 293:620-625 (1981), or 

15 alternatively, Kaufman et al, Mol. Cell. Biol. , 5(7): 1750-1 759 (1985) or Howley et al, U. S. Patent 4,419,446. Other 
suitable mammalian cell lines are the CV-1 cell line, BHK cell lines and the 293 eel! line. The monkey COS-1 cell line 
is presently believed to be inefficient in BMP heterodimer production. 

[0047] Many strains of yeast cells known to those skilled in the art may also be available as host cells for expression 
of the polypeptides of the present invention, e.g., Saccharomyces cerevisiae. Additionally, where desired, insect cells 
20 may be utilized as host cells in the method of the present invention. See, e.g., Miller et al, Genetic Engineering , 8: 
277-298 (Plenum Press 1986) and references cited therein. 

[0048] Another method for producing a biologically active heterodimeric protein of this invention may be employed 
where the host cells are microbial, preferably bacterial cells, in particular E. coli. For example, the various strains of 
E. coli (e.g., HB101, MC1061) are well-known as host cells in the field of biotechnology. Various strains of B. subtilis , 

25 Pseudomonas, other bacilli and the like may also be employed in this method. 

[0049] This method, which may be employed to produce monomers and dimers (both homodimers and heterodimers) 
is described in European Patent Application No. 433,225. Briefly, this process involves culturing a microbial host com- 
prising a nucleotide sequence encoding the desired BMP protein linked in the proper reading frame to an expression 
control sequence which permits expression of the protein and recovering the monomelic, soluble protein. Where the 

30 protein is insoluble in the host cells, the water-insoluble protein fraction is isolated from the host cells and the protein 
is solubilized. After chromatographic purification, the solubilized protein is subjected to selected conditions to obtain 
the biologically active dimeric configuration of the protein. This process, which may be employed to produce the het- 
erodimers of this invention, is described specifically in Example 7, for the production of a BMP-2 homodimer. 
[0050] Another aspect of the present invention provides DNA molecules or plasmid vectors for use in expression of 

35 these recombinant heterodimers. These plasmid vectors may be constructed by resort to known methods and available 
components known to those of skill in the art. In general, to generate a vector useful in the methods of this invention, 
the DNA encoding the desired BMP protein is transferred into one or more appropriate expression vectors suitable for 
the selected host cell. 

[0051] It is presently contemplated that any expression vector suitable for efficient expression in mammalian cells 
40 may be employed to produce the recombinant heterodimers of this invention in mammalian host celts. Preferably the 
vectors contain the selected BMP DNA sequences described above and in the Figures, which encode selected BMP 
components of the heterodimer. Alternatively, vectors incorporating modified sequences as described in the above- 
referenced patent applications are also embodiments of the present invention and useful in the production of the vec- 
tors. 

45 [0052] In addition to the specific vectors described in Example 1, one skilled in the art can construct mammalian 
expression vectors by employing the sequence of Figures 1 -6 or other DNA sequences containing the coding sequenc- 
es of Figures 1-6 (SEQ ID NOs: 1, 3, 5, 7, 9 and 11), or other modified sequences and known vectors, such as pCD 
[Okayama et al, Mol. Cell Biol. , 2: 161-1 70 (1 982)] and pJL3, pJL4 [Gough et al, EMBO J. , 4:645-653 (1 985)]. The BMP 
DNA sequences can be modified by removing the non-coding nucleotides on the 5' and 3' ends of the coding region. 

so The deleted non-coding nucleotides may or may not be replaced by other sequences known to be beneficial for ex- 
pression. The transformation of these vectors into appropriate host cells as described above can produce desired 
heterodimers. 

[0053] One skilled in the art could manipulate the sequences of Figures 1 -6 by eliminating or replacing the mammalian 
regulatory sequences flanking the coding sequence with e.g., yeast or insect regulatory sequences, to create vectors 
55 for intracellular or extracellular expression by yeast or insect cells. [See, e.g., procedures described in published Eu- 
ropean Patent Application 155,476] for expression in insect cells; and procedures described in published PCT appli- 
cation WO86/00639 and European Patent Application EPA 123,289 for expression in yeast cells]. 
[0054] Similarly, bacterial sequences and preference codons may replace sequences in the described and exempli- 
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tied mammalian vectors to create suitable expression systems for use in the production of BMP monomers in the 
method described above. For example, the coding sequences could be further manipulated (e.g., ligated to other known 
linkers or modified by deleting non-coding sequences therefrom or altering nucleotides therein by other known tech- 
niques) The modified BMP coding sequences could then be inserted into a known bacterial vector using procedures 
such as described in T. Taniguchi et al, Proc. Natl. Acad. Sci. USA, 77:5230-5233 (1980). The exemplary bacterial 
vector could then be transformed into bacterial host cells and BMP heterodimers expressed thereby. An exemplary 
vectorfor microbial, e.g., bacterial, expression is described below in Example 7. 

[0055] Other vectors useful in the methods of this invention may contain multiple genes in a single transcription unit. 
For example, a proposed plasmid p7E2D contains the BMP-7 gene followed by the EMC leader sequence, followed 
by the BMP-2 gene, followed by the DHFR marker gene. Another example is plasmid p7E2ED which contains the 
BMP-7 gene, the EMC leader, the BMP-2 gene, another EMC leader sequence and the DHFR marker gene. Alterna- 
tively the vector may contain more than one transcription unit. As one example, the plasmid p2ED7ED contains a 
transcription unit for BMP-2 and a separate transcription unit for BMP-7, i.e., BMP-2-EMC-DHFR and BM P-7-EMC-DH- 
FR. Alternatively, each transcription unit on the plasmid may contain a different marker gene. For example, plasmid 
p2EN7ED contains BMP-2-EMC-Neo and BMP-7-EMC-DHFR. 

[0056] Additionally the vectors also contain appropriate expression control sequences which are capable of directing 
the replication and expression of the BMP in the selected host cells. Useful regulatory sequences for such vectors are 
known to one of skill in the art and may be selected depending upon the selected host cells. Such selection is routine 
and does not form part of the present invention. Similarly, the vectors may contain one or more selection markers, such 
as the antibiotic resistance gene, Neo or selectable markers such as DHFR and ADA. The presently preferred marker 
gene is DHFR. These marker genes may also be selected by one of skill in the art. 

[0057] Once they are expressed by one of the methods described above, the heterodimers of this invention may be 
identified and characterized by application of a variety of assays and procedures. A co-precipitation (immunoprecipi- 
tation) assay may be performed with antibodies to each of the BMPs forming the heterodimer. Generally antibodies 
for this use may be developed by conventional means, e.g., using the selected BMP, fragments thereof, or synthetic 
BMP peptides as antigen. Antibodies employed in assays are generally polyclonal antibodies made from individual 
BMP peptides or proteins injected into rabbits according to classical techniques. This assay is performed conventionally, 
and permits the identification of the heterodimer, which is precipitated by antibodies to both BMP components of the 
heterodimer. In contrast, only one of the two antibodies causes precipitation of any homodimeric form which may be 
produced in the process of producing the heterodimer. 

[0058] Another characterizing assay is a Western assay, employing a precipitating antibody, a probing antibody and 
a detecting antibody. This assay may also be performed conventionally, by using an antibody to one of the BMPs to 
precipitate the dimers, which are run on reducing SDS-PAGE for Western analysis. An antibody to the second BMP is 
used to probe the precipitates on the Western gel for the heterodimer. A detecting antibody, such as a goat-antirabbit 
antibody labelled with horseradish peroxidase (HRP), is then applied, which will reveal the presence of one of the 
component subunits of the heterodimer. 

[0059] Finally, the specific activity of the heterodimer may be quantitated as described in detail in Example 6. Briefly, 
the amount of each BMP is quantitated using Western blot analysis or pulse labelling and SDS-PAGE analysts in 
samples of each BMP homodimer and the heterodimer. The W20 activity is also determined as described specifically 
in Example 8. The relative specific activities may be calculated by the formula: W20 alkaline phosphatase activity/ 
amount of BMP on Western blot or by fluorography. As one example, this formula has been determined for the BMP- 
2/7 heterodimer, demonstrating that the heterodimer has an estimated 5 to 50 fold higher specific activity than the 
BMP-2 homodimer. 

[0060] The heterodimers of the present invention may have a variety of therapeutic and pharmaceutical uses, e.g., 
in compositions for wound healing, tissue repair, and in similar compositions which have been indicated for use of the 
individual BMPs. Increased potency of the heterodimers over the individual BMPs may permit lower dosages of the 
compositions in which they are contained to be administered to a patient in comparison to dosages of compositions 
containing only a single BMP. A heterodimeric protein of the present invention, which induces cartilage and/or bone 
growth in circumstances where bone is not normally formed, has application in the healing of bone fractures and 
cartilage defects in humans and other animals. Such a preparation employing a heterodimeric protein of the invention 
may have prophylactic use in closed as well as open fracture reduction and also in the improved fixation of artificial 
joints. De novo bone formation induced by an osteogenic agent contributes to the repair of congenital, trauma induced, 
or oncologic resection induced craniofacial defects, and also is useful in cosmetic plastic surgery. 
[0061] A heterodimeric protein of this invention may be used in the treatment of periodontal disease, and in other 
tooth repair processes. Such agents may provide an environment to attract bone-forming cells, stimulate growth of 
bone-forming cells or induce differentiation of progenitors of bone-forming cells. Heterodimeric polypeptides of the 
invention may also be useful in the treatment of osteoporosis. A variety of osteogenic, cartilage-inducing and bone 
inducing factors have been described. See, e.g., European Patent Applications 148,155 and 169,016 for discussions 
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thereof. 

[0062] The proteins of the invention may also be used in wound healing and related tissue repair. The types of 
wounds include, but are not limited to burns, incisions and ulcers. (See, e.g., PCT Publication WO84/01106 for dis- 
cussion of wound healing and related tissue repair). 
5 [0063] Additionally, the proteins of the invention may increase neuronal survival and therefore be useful in transplan- 
tation and treatment of conditions exhibiting a decrease in neuronal survival. 

[0064] In view of the usefulness of the heterodimers, therefore, a further aspect of the invention is a therapeutic 
composition for repairing fractures and other conditions related to cartilage and/or bone defects or periodontal diseases. 
In addition, the invention comprises therapeutic compositions for wound healing and tissue repair. Such compositions 

10 comprise a therapeutically effective amount of a heterodimeric protein of the invention in admixture with a pharmaceu- 
tically acceptable vehicle, carrier or matrix. The preparation and formulation of such physiologically acceptable protein 
compositions, having due regard to pH, isotonicity, stability and the like, is within the skill of the art. 
[0065] It is expected that the proteins of the invention may act in concert with other related proteins and growth 
factors. Therapeutic compositions of the invention therefore comprise a therapeutic amount of a heterodimeric protein 

15 of the invention with a therapeutic amount of at least one of the other BMP proteins disclosed in co-owned and con- 
currently filed U.S. applications described above. Such combinations may comprise separate molecules of the BMP 
proteins or other heteromolecules of the present invention. 

[0066] In further compositions, heterodimeric proteins of the invention may be combined with other agents beneficial 
to the treatment of the bone and/or cartilage defect, wound, or tissue in question. These agents include various growth 
20 factors such as epidermal growth factor (EGF), platelet derived growth factor (PDGF), transforming growth factors 
(TGF-ot and TGF-p), and insulin-like growth factor (IGF). 

[0067] The therapeutic compositions are also presently valuable for veterinary applications due to the lack of species 
specificity in BMP proteins. Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 
patients for such treatment with heterodimeric proteins of the present invention. 

25 [0068] The composition is to be administered topically, systematically, or locally as an implant or device. When ad- 
ministered, the therapeutic composition for use in this invention is, of course, in a pyrogen-free, physiologically accept- 
able form. Further, the composition may desirably be encapsulated or injected in a viscous form for delivery to the site 
of bone, cartilage or tissue damage. Topical administration may be suitable for wound healing and tissue repair. Ther- 
apeutically useful agents other than the heterodimeric proteins of the invention which may also optionally be included 

30 in the composition as described above, may alternatively or additionally, be administered simultaneously or sequentially 
with the heterodimeric BMP composition in the methods of the invention. Preferably for bone and/or cartilage formation, 
the composition would include a matrix capable of delivering the heterodimeric protein-containing composition to the 
site of bone and/or cartilage damage, providing a structure for the developing bone and cartilage and optimally capable 
of being resorbed into the body. Such matrices may be formed of materials presently in use for other implanted medical 

35 applications. 

[0069] The choice of matrix material is based on biocompatibility, biodegradability, mechanical properties, cosmetic 
appearance and interface properties. The particular application of the heterodimeric BMP compositions will define the 
appropriate formulation. Potential matrices for the compositions maybe biodegradable and chemically defined calcium 
sulfate, tricalciumphosphate, hydroxy apatite, polylactic acid, potyglycolic acid and polyanhydrides. Other potential ma- 

40 terials are biodegradable and biologically well defined, such as bone or dermal collagen. Further matrices are comprised 
of pure proteins or extracellular matrix components. Other potential matrices are nonbiodegradable and chemically 
defined, such as sintered hydroxy apatite, bioglass, aluminates, or other ceramics. Matrices may be comprised of com- 
binations of any of the above mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalciumphosphate. The bioceramics may be altered in composition, such as in calcium-aluminate-phosphate and 

45 processing to alter pore size, particle size, particle shape, and biodegradability. 

[0070] Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and glycolic acid in the form of porous 
particles having diameters ranging from 150 to 800 microns. In some applicatons, it will be useful to utilize a seques- 
tering agent, such as carboxymethyl cellulose or autologous blood clot, to prevent the BMP compositions from dissas- 
sociating from the matrix. 

50 [0071 ] The dosage regimen of a heterodimeric protein-containing pharmaceutical composition will be determined by 
the attending physician considering various factors which modify the action of the heterodimeric proteins, e.g. amount 
of bone weight desired to be formed, the site of bone damage, the condition pf the damaged bone, the size of a wound, 
type of damaged tissue, the patient's age, sex, and diet, the severity of any infection, time of administration and other 
clinical factors. The dosage may vary with the type of matrix used in the reconstitution and the BMP proteins in the 

55 heterodimer and any additional BMP or other proteins in the pharmaceutical composition. For example, the addition 
of other known growth factors, such as IGF I (insulin like growth factor I), to the final composition, may also effect the 
dosage. Progress can be monitored by periodic assessment of bone growth and/or repair, for example, X-rays, histo- 
morphometric determinations and tetracycline labeling. 
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[0072] The following examples are illustrative of the present invention and do not limit its scope. 

[0073] They encompass BMP monomers and heterodimers not included within the scope of protection of the present 

invention, which, however, are maintained for illustration and enablement purposes. 

5 EXAMPLE 1 - BMP Vector constructs and Cell Lines 

A. BMP-2 Vectors 

[0074] The mammalian expression vectorpMT 2 CXM is a derivative of p91 023 (b) [Wong et al, Science, 228:81 0-815 
w (1985)] differing from the latter in that it contains the ampicillin resistance gene (Amp) in place of the tetracycline 
resistance gene (Tet) and further contains a Xhol site for insertion of cDNA clones. The functional elements of pMT2 
CXM have been described [R. J. Kaufman, Proc. Natl. Acad. Sci. USA, 82:689-693 (1 985)] and include the adenovirus 
VA genes, the SV40 origin of replication including the 72 bp enhancer, the adenovirus major late promoter including a 
5' splice site and the majority of the adenovirus tripartite leader sequence present on adenovirus late mRNAs, a 3' 
15 splice acceptor site, a DHFR insert, the SV40 early polyadenylation site (SV40), and pBR322 sequences needed for 
propagation in E. coli. 

[0075] EcoRI digestion of pMT2-VWF, which has been deposited with the American Type Culture Collection (ATCC), 
Rockville, MD (USA) under accession number ATCC 67122, excises the cDNA insert present in pMT2-VWF, yielding 
pMT2 in linearform. Plasmid pMT2 can be ligated and used to transform E. coli HB 1 01 or DH-5 to ampicillin resistance. 
20 Plasmid pMT2 DNA can be prepared by conventional methods. 

[0076] Plasmid pMT2 CXM is then constructed using loopout/in mutagenesis [Morinaga et al, Biotechnology , 84:636 
(1984)]. This removes bases 1075 to 1145 relative to the Hindlll site near the SV40 origin of replication and enhancer 
sequences of pMT2. In addition it inserts the following sequence: 

25 

5' P0 4 — CATGGGCAGCTCGAG— 3 ' (SEQ ID NO: 15) 

at nucleotide 1145. This sequence contains the recognition site for the restriction endonuclease Xhol. 
30 [0077] A derivative of pMT2 CXM, termed plasmid pMT23, contains recognition sites for the restriction endonucle- 
ases Pstl, EcoRI, Sail and Xhol. 

[0078] Full length BMP-2 cDNA (Fig. 1) (SEQ ID NO: 1) is released from the A.GT10 vector by digestion with EcoRI 
and sub-cloned into pSP65 [Promega Biotec, Madison, Wisconsin; see, e.g., Melton et al, Nucl. Acids Res. , 12: 
7035-7056 (1984)] in both orientations yielding pBMP-2 #39-3 orpBMP-2 #39-4. 
35 [0079] The majority of the untranslated regions of the BMP-2 cDNA are removed in the following manner. The 5* 
sequences are removed between the Sail site in the adapter (present from the original cDNA cloning) and the Sail site 
7 base pairs upstream of the initiator ATG by digestion of the pSP65 plasmid containing the BMP-2 cDNA with Sail 
and religation. The 3' untranslated region is removed using heteroduplex mutagenesis using the oligonucleotide 

40 

5' GAGGGTTGTGGGTGTCGC TAGT GA GTCGACT ACAGCAAAATT 3'. 

End Sail 

(SEQ ID NO: 16) 

45 

The sequence contains the terminal 3' coding region of the BMP-2 cDNA, followed immediately by a recognition site 
for Sail. The sequence introduces a Sail site following the termination (TAG) codon. 

[0080] The Sail fragment of this clone was subcloned into the expression vector pMT23, yielding the vector 
pMT23-BMP2AUT. Restriction enzyme sites flank the BMP-2 coding region in the sequence Pstl-EcoRI-Sall-BMP-2 

so cDHA-Sall-EcoRI-Xhol. 

[0081] The expression plasmid pED4 [Kaufman et al, Nucl. Acids Res., 19:4485-4490 (1991)] was linearized by 
digestion with EcoRI and treated with calf intestinal phosphatase. The BMP-2 CDNA gene was excised from 
pMT23-BMP2AUT by digestion with EcoRI and recovery of the 1 .2 kb fragment by electrophoresis through a 1 .0% low 
melt agarose gel. The linearized pED4 vector and the EcoRI BMP-2 fragment were ligated together, yielding the BMP- 

55 2 expression plasmid pBMP2A-EMC. 

[0082] Another vector pBMP-2A-EN contains the same sequences contained within the vector pBMP2A-EMC, except 
the DHFR gene has been replaced by conventional means with the neomycin resistance gene from the Tn5 transpos- 
able element. 
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B. BMP-4 Vectors 

[0083] A BMP-4 cDNA sequence set forth in Figure 2 (SEQ ID NO: 3), in which the 3' untranslated region is removed, 
is made via heteroduplex mutagenesis with the mutagenic oligonucleotide: 

5 

5 ' GGATGTGGGTGCCG CTGA CTCTAGAGTCGAC GGAATTC 3 ' 

End EcoRI 

(SEQ ID NO: 17) 

w 

This deletes all of the sequences 3' to the translation terminator codon of the BMP-4 cDNA, juxtaposing this terminator 
codon and the vector polylinker sequences. This step is performed in an SP65 vector [Promega Biotech] and may also 
be conveniently performed in pMT2-derivatives containing the BMP-4 cDNA similar to the BMP2 vectors described 
15 above. The 5' untranslated region is removed using the restriction endonuclease Bsml, which cleaves within the eighth 
codon of BMP-4 cDNA. 

[0084] Reconstruction of the first eight codons is accomplished by ligation to oligonucleotides: 

20 EcoRI Initiator Bsml 

5' AATTCA CCATGATTCCTGGTAACC gAATGCT 3' (SEQ ID NO: 18) 



and 

25 

3' GTGGTACTAAGGACCATTGGCTTAC 5' (SEQ ID NO: 19) 

30 These oligonucleotides form a duplex which has a Bsml complementary cohesive end capable of ligation to the Bsml 
restricted BMP-4 cDNA, and it has an EcoRI complementary cohesive end capable of ligation to the EcoRI restricted 
vector pMT2. Thus the cDNA for BM P-4 with the 5' and 3' untranslated regions deleted, and retaining the entire encoding 
sequence is contained within an EcoRI restriction fragment of approximately 1 .2 kb. 

[0085] The pMT2 CXM plasmid containing this BMP-4 sequence is designated pXMBMP-4AUT. It is digested with 
35 EcoRI in order to release the BMP-4 cDNA containing insert from the vector. This insert is subcloned into the EcoRI 
site of the mammalian expression vector pED4, resulting pBMP4A-EMC. 

C. BMP-5 Vectors 

40 [0086] A BMP-5 cDNA sequence comprising the nucleotide sequence from nucleotide #699 to #2070 of Fig. 5 (SEQ 
ID NO: 9) is specifically amplified as follows. The oligonucleotides CGACCTGCAGCCACCATGCATCTGACTGTA 
(SEQ ID NO: 20) and TGCCTGCAGTTTAATATTAGTGGCAGC (SEQ ID NO: 21) are utilized as primers to allow the 
amplification of nucleotide sequence #699 to #2070 of Fig. 5 from the BMP-5 insert of A.-ZAP clone U2-16 [ATCC 
#68109]. This procedure introduces the nucleotide sequence CGACCTGCAGCCACC (SEQ ID NO: 22) immediately 

45 preceeding nucleotide #699 and the nucleotide sequence CTGCAGGCA immediately following nucleotide #2070. The 
addition of these sequences results in the creation of Pstl restriction endonuclease recognition sites at both ends of 
the amplified DNA fragment. The resulting amplified DNA product of this procedure is digested with the restriction 
endonuclease Pstl and subcloned into the Pstl site of the pMT2 derivative pMT21 [Kaufman, Nucl. Acids Res. , 19: 
4485-4490 (1991)]. The resulting clone is designated H5/5/pMT. 

so [0087] The insert of H5/5/pMT is excised by Pstl digestion and subcloned into the plasmid vector pSP65 [Promega 
Biotech] at the Pstl site, resulting in plasmid BMP5/SP6. BMP5/SP6 and U2-16 are digested with the restriction endo- 
nucleases Nsil and Ndel to excise the portion of their inserts corresponding to nucleotides #704 to #1876 of Fig. 5. 
The resulting 1173 nucleotide Nsil-Ndel fragment of clone U2-16 is ligated into the Nsii-Ndel site of BMP5/SP6 from 
which the corresponding 1173 nucleotide Nsil-Ndel fragment had been removed. The resulting clone is designated 

55 BMP5mix/SP65. 

[0088] Direct DNA sequence analysis of BMP5mix/SP65 is performed to confirm identity of the nucleotide sequences 
produced by the amplification to those set forth in Fig. 5. The clone BMP5mix/SP65 is digested with the restriction 
endonuclease Pstl resulting in the excision of an insert comprising the nucleotides #699 to #2070 of Fig. 5 and the 
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additional sequences containing the Pstl recognition sites as described above. The resulting 1382 nucleotide Psti 
fragment is subcloned into the Pstl site of the pMT2 derivative pMT21 . This clone is designated BMP5mix/pMT21#2. 
[0089] The same fragment is also subcloned into the Psti site of pED4 to yield the vector designated BM P5mix-EMC- 
11. 

5 

D. BMP-6 Vectors 

[0090] A BMP-6 cDNA sequence comprising the nucleotide sequence from nucleotide #1 60 to #1 706 of Fig. 4 (SEQ 
ID NO: 7) is produced by a series of techniques known to those skilled in the art. The clone BMP6C35 [ATCC 68245] 
10 is digested with the restriction endonucleases Apal and Taql, resulting in the excision of a 1476 nucleotide portion of 
the insert comprising nucleotide #231 to #1703 of Fig. 4. Synthetic oligonucleotides with Sail restriction endonuclease 
site converters are designed to replace those nucleotides corresponding to #160 to #230 and #1704 to #1706 which 
are not contained in the 1476 Apal-Taql fragment of the BMP-6 cDNA sequence. 

[0091] Oligonucleotide/Sali converters conceived to replace the missing 5' (TCGACCCACCATGCCGGGGCT 
15 GGGGCGGAGGGCGCAGTGGCTGTGCTGGTGGTGGGGGCTGTGCTGCAGCTGCTGCGGGGCC (SEQ ID NO: 
23) and CGCAGCAGCTGCACAGCAGCCCCCACCACCAGCACAGCCACTGCGCCCTCCGCCCCAGCCCCGGCAT 
GGTGGG) (SEQ ID NO: 24) and 3' (TCGACTGGTTT and (SEQ ID NO: 25) and CGAAACCAG (SEQ ID NO: 26) ) 
sequences are annealed to each other independently. The annealed 5' and 3' converters are then ligated to the 1476 
nucleotide Apal-Taql described above, creating a 1563 nucleotide fragment comprising the nucleotide sequence from 
20 #1 60 to #1 706 of Fig. 4 and the additional sequences contrived to create Sail restriction endonuclease sites at both 
ends. The resulting 1563 nucleotide fragment is subcloned into the Sail site of pSP64 [Promega Biotech, Madison, 
Wl]. This clone is designated BMP6/SP64#15. 

[0092] DN A sequence analysis of BMP6/SP64#1 5 is performed to confirm identity of the 5' and 3' sequences replaced 
by the converters to the sequence set forth in Fig. 4. The insert of BMP6/SP64#15 is excised by digestion with the 
25 restriction endonuclease Sail. The resulting 1 563 nucleotide Sail fragment is subcloned into the Xhol restriction endo- 
nuclease site of pMT21 and designated herein as BMP6/pMT21 . 

[0093] The Pstl site of pED4 is converted to a Sail site by digestion of the plasmid with Pstl and ligation to the 
converter oligonucleotides: 

30 

5'-TCGACAGGCTCGCCTGCA-3' (SEQ ID NO: 27) 

and 

35 

3 ' -GTCCGAGCGG-5 ' (SEQ ID NO: 28). 

40 The above 1 563 nucleotide Sail fragment is also subcloned into the Sail site of this pED4 vector, yielding the expression 
vector BMP6/EMC. 

E. BMP-7 Vectors 

45 [0094] A BMP-7 sequence comprising the nucleotide sequence from nucleotide #97 to #1 402 of Fig. 3 (SEQ ID NO: 
5) is specifically amplified as follows. The oligonucleotides CAGGTCGACCCACCATGCACGTGCGCTCA (SEQ ID NO: 
29) and TCTGTCGACCTCGGAGGAGCTAGTGGC (SEQ ID NO: 30) are utilized as primers to allow the amplification 
of nucleotide sequence #97 to #1402 of Fig. 3 from the insert of clone PEH7-9 [ATCC #68182]. This procedure gen- 
erates the insertion of the nucleotide sequence CAGGTCGACCCACC immediately preceeding nucleotide #97 and the 

so insertion of the nucleotide sequence GTCGACAGA immediately following nucleotide #1402. The addition of these 
sequences results in the creation of a Sail restriction endonuclease recognition site at each end of the amplified DNA 
fragment. The resulting amplified DNA product of this procedure is digested with the restriction endonuclease Sail and 
subcloned into the Sail site of the plasmid vector pSP64 [Promega Biotech, Madison, Wl] resulting in BMP7/SP6#2. 
[0095] The clones BM P7/SP6#2 and PEH7-9 are digested with the restriction endonucleases Ncol and Stul to excise 

55 the portion of their inserts corresponding to nucleotides #363 to #1 081 of Fig. 3. The resulting 71 9 nucleotide Ncol-Stul 
fragment of clone PEH7-9 is ligated into the Ncol-Stul site of BMP7/SP6#2 from which the corresponding 71 9 nucleotide 
fragment is removed. The resulting clone is designated BMP7mix/SP6. 

[0096] Direct DNA sequence analysis of BM P7mix/SP6 confirmed identity of the 3' region to the nucleotide sequence 
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from #1082 to #1402 of Fig. 3, however the 5' region contained one nucleotide misincorporation. 
[0097] Amplification of the nucleotide sequence (#97 to #1402 of Fig. 3) utilizing PEH7-9 as a template is repeated 
as described above. The resulting amplified DNA product of this procedure is digested with the restriction endonucle- 
ases Sail and Pstl. This digestion results in the excision of a 747 nucleotide fragment comprising nucleotide #97 to 
s #833 of Fig. 3 plus the additional sequences of the 5* priming oligonucleotide used to create the Sail restriction endo- 
nuclease recognition site described earlier. This 747 Sall-Pstl fragment is subcloned into a Sall-Pstl digested pSP65 
[Promega Biotech, Madison, Wl] vector resulting in 5'BMP7/SP65. DNA sequence analysis demonstrates that the insert 
of the 5'BMP7/SP65#1 comprises a sequence identical to nucleotide #97 to #362 of Fig. 3. 

[0098] The clones BMP7mix/SP6 and 5BMP7/SP65 are digested with the restriction endonucleases Sail and Ncol. 
io The resulting 3' Ncol-Sall fragment of BMP7mix/SP6 comprising nucleotides #363 to #1402 of Fig. 3 and 5 1 Sall-Ncol 
fragment of 5'BMP7/SP65 comprising nucleotides #97 to #362 of Fig. 3 are ligated together at the Ncol restriction sites 
to produce a 1317 nucleotide fragment comprising nucleotides #97 to #1402 of Fig. 3 plus the additional sequences 
derived from the 5* and 3* oligonucleotide primers which allows the creation of Sail restriction sites at both ends of this 
fragment. 

15 [0099] This 131 7 nucleotide Sail fragment is ligated nto the Sail site of the pMT2 derivative pMT2Cla-2. pMT2Cla- 
2 is constructed by digesting pMT21 with EcoRV and Xhol, treating the digested DNA with Klenow fragment of DNA 
polymerase I and ligating Clal linkers (NEBio Labs, CATCGATG). This removes bases 21 71 to 2420 starting from the 
Hindlll site near the SV40 origin of replication and enhancer sequences of pMT2 and introduces a unique Clal site, 
but leaves the adenovirus VAI gene intact, resulting in pMT2Cla-2. This clone is designated BMP-7-pMT2. 

20 [0100] The insert of BMP-7-pMT2 is excised by digestion with the restriction endonuclease Sail. The resulting 131 7 
nucleotide Sail fragment is subcloned into the Xhol restriction endonuclease site of pMT21 to yield the clone BMP- 
7/pMT21 . This Sail fragment is also subcloned into the Sail site of the pED4 vector in which the Pstl site was converted 
into a Sail site as described above, resulting in the vector pBMP7/EMC#4. 

25 F. BMP-8 Vectors 

[0101] At present no mammalian BMP-8 vectors have been constructed. However, using the sequence of Figure 6 
(SEQ ID NO: 11), it is contemplated that vectors similar to those described above for the other BMPs may be readily 
constructed. A bacterial expression vector similar to the BMP-2 vector described in detail in Example 7 may also be 
30 constructed for BMP-8, by introducing a Met before the amino acid #284 Ala of Fig. 6. This sequence of BMP-8 is 
inserted into the vector pALBP2-781 in place of the BMP-2 sequence. See Example 7. 

G. BMP Vectors Containing the Adenosine Deaminase (Ada) Marker 

35 [0102] BMP genes were inserted into the vector pMT3SV2Ada [R. J. Kaufman, Meth. Enz. , 185:537-566 (1990)] to 
yield expression plasmids containing separate transcription units for the BMP cDNA gene and the selectable marker 
Ada. pMT3SV2Ada contains a polylinker with recognition sites for the enzymes Pstl, EcoRI, Sail and Xbal that can be 
used for insertion of and expression of genes (i.e. BMP) in mammalian cells. In addition, the vector contains a second 
transcription unit encoding Ada which serves as a dominant and amplifiable marker in mammalian cells. 

40 [0103] To construct expression vectors for BMP-5, BMP-6 and BMP-7, individually, the same general method was 
employed. The gene for BMP 5 (Fig. 5), 6 (Fig. 4) or 7 (Fig. 3) was inserted into the polylinker essentially as described 
above for the pED4 vector. These vectors can be used fortransfection into CHO DUKX cells and subsequent selection 
and amplification using the Ada marker as previously described [Kaufman et al, Proc. Natl Acad. Sci. USA , 83 : 
3136-3140 (1986)]. Since each such vector does not contain a DHFR gene, the resultant transformed cells remain 

45 DHFR negative and can be subsequently transfected with a second vector containing a different BMP in conjunction 
with DHFR and amplified with methotrexate. 

[0104] Alternatively, the pMT3SV2Ada/BMP vectors can be used to transfect stable CHO cell lines previously trans- 
fected with a different BMP gene and amplified using the DHFR/methotrexate system. The resultant transfectants can 
be subsequently amplified using the Ada system, yielding cell lines that coexpress two different BMP genes, and are 
so amplified using both the DHFR and Ada markers. 

H. BMP-Expressing Mammalian Cell Lines 

[0105] At present, the most desirable mammalian cell lines for use in producing the recombinant homodimers and 
55 heterodimers of this invention are the following. These cell lines were prepared by conventional transformation of CHO 
cells using vectors described above. 

[0106] The BMP-2 expressing ceil line 2EG5 is a CHO cell stably transformed with the vector pBMP2delta-EMC. 
[0107] The BMP-4 expressing cell line 4E9 is a CHO cell stably transformed with the vector pBMP4delta-EMC. 
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[0108] The BMP-5 expressing cell line 5E1 0 is a CHO cell stably transformed with the vector BMP5mix-EMC-1 1 (at 
a amplification level of 2 micromolar MTX). 

[0109] The BMP-6 expressing cell line 6HG8 is a CHO cell stably transformed with the vector BMP6/EMC. 
[0110] The BMP-7 expressing cell line 7MB9 is a CHO cell stably transformed with the vector BMP7/pMT21 . 

5 

EXAMPLE 2 - TRANSIENT EXPRESSION OF BMP HETERODIMERS 

[0111] The heterodimers of the present invention maybe prepared by co -express ion in a transient expression system 
for screening in the assays of Example 8 by two different techniques as follows. 

io [0112] In the first procedure, the pMT2-derived and EMC-derived expression plasmids described in Example 1 and 
other similarly derived vectors were constructed which encoded, individually, BMP-2 through BMP-7, and transforming 
growth factor-beta (TGFf}1). All combinations of pairs of plasmids were mixed in equal proportion and used to co- 
transfect CHO cells using the DEAE-dextran procedure [Sompayrac and Danna, Proc. Natl. Acad. Sci. USA, 78 : 
7575-7578 (1981); Luthman and Magnusson, Nucl. Acids Res., 11:1295-1308 (1983)]. The cells are grown in alpha 

15 Minimal Essential Medium (a-MEM) supplemented with 10% fetal bovine serum, adenosine, deoxyadenosine, thymi- 
dine (100 (ig/ml each), pen/strep, and glutamine (1 mM). 

[0113] The addition of compounds such as heparin, suramin and dextran sulfate are desirable in growth medium to 
increase the amounts of BMP-2 present in the conditioned medium of CHO cells. Similarly responsive to such com- 
pounds is BMP-5. Therefore, it is expected that these compounds will be added to growth medium for any heterodimer 
20 containing these BMP components. Other BMPs may also be responsive to the effects of these compounds, which 
are believed to inhibit the interaction of the mature BMP molecules with the cell surface. 

[0114] The following day, fresh growth medium, with or without 1 00 u.g/ml heparin, was added. Twenty-four hours 
later, conditioned medium was harvested. 

[0115] In some experiments, the conditioned medium was collected minus heparin for the 24-48 hour period post- 
25 transfection, and the same plates were then used to generate conditioned medium in the presence of heparin 48-72 
hour post-transfection. Controls included transfecting cells with expression plasmids lacking any BMP sequences, 
transfecting cells with plasmids containing sequences for only a single BMP, or mixing conditioned medium from cells 
transfected with a single BMP with conditioned medium from cells transfected with a different BMP. 
[0116] Characterizations of the coexpressed heterodimer BMPs in crude conditioned media, which is otherwise not 
30 purified, provided the following results. Transiently coexpressed BMP was assayed for induction of alkaline phos- 
phatase activity on W20 stromal cells, as described in Example 8. 

[0117] Co-expression of BMP-2 with BMP-5, BMP-6 and BMP-7, and BMP-4 with BMP-5, BMP-6 and BMP-7 yielded 
more alkaline phosphatase inducing activity iri the W20 assay than either of the individual BMP homodimers alone or 
mixtures of homodimers, as shown below. Maximal activity (in vitro ), was obtained when BMP-2 was coexpressed with 
35 BMP-7. Increased activity was also found for the heterodimers BMP-2/5; BMP-2/6; BMP-4/5; BMP-4/6; and BMP-4/7. 
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(continued) 
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Units: 1 unit of activity is equivalent to that of 1 ng/ml of rhBMP-2. 
-: indicates activity below the detection limit of the assay. 



[0118] These BMP combinations were subsequently expressed using various ratios of expression plasmids (9:1 , 3: 
1 , 1 :1 , 1 :3 ( 1 :9) during the CHO cell transient transfection. The performance of this method using plasmids containing 
BMP-2 and plasmids containing BMP-7 at plasmid number ratios ranging from 9:1 to 1:9, respectively, demonstrated 
that the highest activity in the W20 assay was obtained when approximately the same number of plasmids of each 
BMP were transfected into the host cell. Ratios of BMP-2 to BMP-7 plasmids of 3:1 to 1 :3 ( respectively, also resulted 
in increased activity in W20 assay in comparison to host cells transfected with plasmids containing only a single BMP. 
However, these latter ratios produced less activity than the 1:1 ratio. 

[0119] Similar ratios may be determined by one of skill in the art for heterodimers consisting of other than BMP-2 
and BMP-7. For example, preliminary work on the heterodimer formed between BMP-2 and BMP-6 has indicated that 
a preferred ratio of plasmids forco-transfection is 3:1 , respectively. The determination of preferred ratios for this method 
is within the skill of the art. 

[0120] As an alternative means to transiently generate coexpressed BMPs, the stable CHO cell lines identified in 
Example 1 expressing each BMP-2, BMP-4, BMP-5, BMP-6 and BMP-7, are cocultured for one day, and are then fused 
with 46.7% polyethylene glycol (PEG). One day post-fusion, fresh medium is added and the heterodimers are harvested 
24 hours later for the W20 assay, described in Example 8. The assay results were substantially similar to those de- 
scribed immediately above. 

[0121] Therefore, all combinations of BMP-2 or 4 coexpressed with either BMP-5, 6 or 7 yielded greater activity than 
any of the BMP homodimers alone. In control experiments where each BMP homodimer was expressed alone and 
conditioned media mixed post harvest, the activity was always intermediate between the individual BMPs, demonstrat- 
ing that the BMP co-expressed heterodimers yield higher activity than combinations of the individually expressed BMP 
homodimers. 

EXAMPLE 3 - STABLE EXPRESSION OF BMP HETERODIMERS 
A. BMP-2/7 

[0122] Based on the results of the transient assays in Example 2, stable cell lines were made that co-express BMP- 
2 and BMP-7. 

[0123] A preferred stable cell line, 2E7E-10, was obtained as follows: Plasmid DNA (a 1 :1 mixture of pBMP-7-EMC 
and pBMP-2-EMC, described in Example 1) is transfected into CHO cells by electro p oration [Neuman et al, EMBO J. , 
1:841-845 (1982)]. 

[01 24] Two days later, cells are switched to selective medium containing 1 0% dialyzed fetal bovine serum and lacking 
nucleosides. Colonies expressing DHFR are counted 10-14 days later. Individual colonies or pools of colonies are 
expanded and analyzed for expression of each heterodimer BMP component RNA and protein using standard proce- 
dures and are subsequently selected for amplification by growth in increasing concentrations of MTX. Stepwise selec- 
tion of the preferred clone, termed 2E7E, is carried out up to a concentration of 0.5 jiM MTX. The cell line is then 
subcloned and assayed for heterodimer 2/7 expression. 

[0125] Procedures for such assay include Western blot analysis to detect the presence of the component DNA, 
protein analysis and SDS-PAGE analysis of metabolically labelled protein, W20 assay, and analysis for cartilage and/ 
or bone formation activity using the ectopic rat bone formation assay of Example 9. The presently preferred clonally- 
derived cell line is identified as 2E7E-1 0. This cell line secretes BMP-2/7 heterodimer proteins into the media containing 
0.5 u.M MTX. 

[0126] The CHO cell line 2E7E-10 is grown in Dulbecco's modified Eagle's medium (DMEM)/Ham's nutrient mixture 
F-12, 1:1 (vol/vol), supplemented with 10% fetal bovine serum. When the cells are 80 to 100% confluent, the medium 
is replaced with serum-free DMEM/F-12. Medium is harvested every 24 hours for 4 days. For protein production and 
purification the cells are cultured serum-free. 
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[0127] While the co-expressing cell line 2E7E-10 preliminarily appears to make lower amounts of BMP protein than 
the BMP2-expressing cell line 2EG5 described in Example 2, preliminary evidence suggests that the specific activity 
of the presumptive heterodimer is at least 5-fold greater than BMP-2 homodimer (see Example 6). 
[0128] To construct another heterodimer producing cell line, the stable CHO cell line 7MB9, previously transfected 

s with pBMP-7-pMT2, and which expresses BMP-7, is employed. 7MB9 may be amplified and selected to 2 u.M meth- 
otrexate resistance using the DHFR/MTX system. To generate a stable co-expressing cell line, cell line 7MB9 is trans- 
fected with the expression vector pBMP-2A-EN (EMC-Neo) containing BMP-2 and the neomycin resistance gene from 
the Tn5 transposable element. The resulting transfected stable cell line was selected for both G-418 and MTX resist- 
ance. Individual clones were picked and analyzed for BMP expression, as described above. 

10 [01 29] It is anticipated that stable cell lines co-expressing other combinations of BM Ps which show enhanced activity 
by transient coexpression will likewise yield greater activity upon stable expression. 

B. BMP-2/6 

*s [0130] Based on the results of the transient assays in Example 2, stable cell lines were made that co-express BMP- 
2 and BMP-6. 

[0131] A preferred stable cell line, 12C07, was obtained as follows: Plasmid DNA (a 1 :3 mixture of pBMP-6-EMC 
and pBMP-2-EMC, described in Example 1) is transfected into CHO cells by elect rop oration [Neuman et al, EMBO J. , 
1:841-845 (1982)]. 

20 [01 32] Two days later, cells are switched to selective medium containing 1 0% dialyzed fetal bovine serum and lacking 
nucleosides. Colonies expressing DHFR are counted 10-14 days later. Individual colonies or pools of colonies are 
expanded and analyzed for expression of each heterodimer BMP component RNA and protein using standard proce- 
dures and are subsequently selected for amplification by growth in increasing concentrations of MTX. Stepwise selec- 
tion of the preferred clone, termed 12-C, is carried out up to a concentration of 2.0 u.M MTX. The cell line is then 

25 subcloned and assayed for heterodimer 2/6 expression. 

[0133] Procedures for such assay include Western blot analysis to detect the presence of the component DNA, 
protein analysis and SDS-PAGE analysis of metabolically labelled protein, W20 assay, and analysis for cartilage and/ 
or bone formation activity using the ectopic rat bone formation assay of Example 9. The presently preferred clonally- 
derived cell line is identified as 12C07. This cell line secretes BMP-2/6 heterodimer proteins into the media containing 

30 2.0 (J.M MTX. 

[0134] The CHO cell line 12C07 is grown in Dulbecco's modified Eagle's medium (DMEM)/Ham's nutrient mixture 
F-12, 1:1 (vol/vol), supplemented with 10% fetal bovine serum. When the cells are 80 to 100% confluent, the medium 
is replaced with serum-free DMEM/F-12, Medium is harvested every 24 hours for 4 days. For protein production and 
purification the cells are cultured serum-free. 

35 [0135] While the co-expressing cell line 12C07 preliminarily appears to make lower amounts of BMP protein than 
the BMP2-expressing cell line 2EG5 described in Example 2, preliminary evidence suggests that the specific activity 
of the presumptive heterodimer is at least 3-5-fold greater than BMP-2 homodimer (see Example 6). 
[0136] To construct another heterodimer producing cell line, the stable CHO cell line 2EG5, previously transfected 
with pBMP-2-EMC, and which expresses BMP-2, is employed. 2EG5 may be amplified and selected to 2 u,M meth- 

40 otrexate resistance using the DHFR/MTX system. To generate a stable co-expressing cell line, cell line 2EG5 is trans- 
fected with the expression vector pBMP-6-ada (ada deaminase) containing BMP-6 and the ADA resistance gene. The 
resulting transfected stable cell line was selected for both DCF and MTX resistance. Individual clones are picked and 
analyzed for BMP expression, as described above. 

[0137] It is anticipated that stable cell lines co-expressing other combinations of BMPs which show enhanced activity 
45 by transient coexpression will likewise yield greater activity upon stable expression. 

EXAMPLE 4-PURIFICATION OF BMP2/7 AND BMP-2/6 HETERODIMER 

[0138] The same purification procedure is used for BMP-2/6 heterodimer and BMP-2/7 heterodimer. Conditioned 
so media from cultures of cell line 2E7E-1 0 or 12C07 containing recombinantly produced BMP heterodimer 2/7V or 2/6, 
respectively, can be generated from either adherent or suspension cultures. For small to medium scale generation of 
coexpressed BMP, adherent cultures are seeded into roller bottles and allowed to grow to confluence in alpha-Minimal 
Eagles Medium [a-MEM, Gibco, Grand Island, NY] containing 10% dialyzed heat-inactivated fetal calf serum [Hazleton, 
Denver, PA]. The media is then switched to a serum-free, albumin free, low protein medium based on a 50:50 mixture 
55 of Delbecco's Modified Eagle's medium and Hams F-12 medium, optionally supplemented with 100 micrograms/ml 
dextran sulfate. Four or five daily harvests are pooled, and used to purify the recombinant protein. 
[0139] Conditioned medium from roller bottle cultures obtained as described above was thawed slowly at room tem- 
perature and pooled. The pH of the pooled medium was adjusted to pH 8.0 using 1 M Tris, pH 8.0. A column was 
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poured containing Matrex Cellufine Sulfate [Amicon] and equilibrated in 50 mM Tris, pH 8.0. 

[0140] Upon completion of loading of the medium, the column was washed with buffer containing 50 mM Tris, 0.4 
M NaCI, pH 8.0 until the absorbance at 280 nm reached baseline. The column was then washed with 50 mM Tris, pH 
8.0 to remove NaCI from the buffer. The resin was then washed with 50 mM Tris, 0.2 M NaCI, 4 M Urea, pH 8.0 until 

5 a peak had eluted. The column was then washed into 50 mM Tris, pH 8.0 to remove the urea. 

[0141] The bound BMP-2/7 or BMP-2/6 was then eluted using 50 mM Tris, 0.5 M NaCI, 0.5 M Arginine, pH 8.0. The 
eiuate was collected as a single pool and may be optionally stored frozen prior to further purification. This Cellufine 
Sulfate eiuate was diluted with 14 volumes of 6M urea and the pH of the sample was then adjusted to 6.0. A hydroxya- 
patite-Ultrogei [IBF] column was poured and equilibrated with 80 mM potassium phosphate, 6M urea, pH 6.0. 

w [0142] After the completion of sample loading, the column was washed with 10 bed volumes of the equilibration 
buffer. Bound BMP-2/7 or BMP-2/6 heterodimers were eluted with 5 bed volumes of 100 mM potassium phosphate, 
6M urea, pH 7.4. This eiuate was loaded directly onto a Vydac C 4 reverse-phase HPLC column equilibrated in water 
- 0.1% TFA. BMP-2/7 or BMP-2/6 heterodimers were eluted with a gradient of 30-50% acetonitrile in water - 0.1% 
trifluoroacetic acid. 

15 [0143] Fractions containing BMPs are identified by SDS-PAGE in the presence or absence of reductant. The identity 
of the BMPs with respect to the heterodimers vs. homodimers is determined by 2D-PAGE (+/- reductant). Fractions 
with heterodimers gave bands which reduce to two spots. Bands from homodimer fractions reduce to a single spot for 
each BMP species. 

[0144] The BMP-2/6 heterodimer subunits are analyzed on a protein sequenator. BMP-2/6 heterodimers of the fol- 
20 lowig species are present: BMP-6 subunit beginning with amino acid #375 Ser-Ala-Ser-Ser in association with BMP- 
2 subunit beginning with amino acid #283 Gin-Ala-Lys or #249 Ser-Lev-His, though other less abundant species may 
be present. 

It is contemplated that the same or substantially similar purification techniques may be employed for any recombinant 
BMP heterodimer of this invention. The hydroxyapatite-Ultrogel column may be unnecessary and that the purification 
25 scheme may be modified by loading the Cellufine Sulfate eiuate directly onto the C 4 reverse-phase HPLC column 
without use of the former column for BMP2/7 or BMP-2/6 or the other heterodimers of this invention. 

EXAMPLE 5 - PROTEIN CHARACTERIZATION 

30 [0145] Total protein secreted from the co-expressing cell lines is analyzed after labelling with 35 S-methionine or by 
Western blot analysis using antibodies raised against both BMPs of the heterodimer, e.g., BMP-2 and BMP-7. Together 
with the alkaline phosphatase assays, the data indicates the presence of the heterodimer and the specific activity. The 
following specific details are directed towards data collected for the BMP-2/7 and BMP-2/6 heterodimers; however, by 
application of similar methods to the other heterodimers described herein, similar results are expected. 

35 

A. 35 S-Met labelling 

[0146] Cell lines derived by cotransfection of BMP2A-EMC and BMP7A-EMC expression vectors were pulsed 
with 35 S-methioninefor 15 minutes, and chased for 6 hours in serum free media in the presence or absence of heparin. 
40 Total secreted protein was analyzed under reducing conditions by PAGE and fluorography. The results demonstrate 
that several cell lines secrete both BMP-2 and BMP-7 protein. There is a good correlation between the amount of 
alkaline phosphatase activity and the amount of coexpressed protein. 

[0147] Several cell lines secrete less total BMP-2 and 7 than the BMP-2-only expressing cell line 2EG5, which pro- 
duces 10 ^ig/ml BMP-2. Cell line 2E7E-10 (amplified at a level of 0.5mM MTX) secretes equal proportions of BMP-2 
and BMP-7 at about the same overall level of expression as the cell line 2EG5. Cell line 2E7E-10 produces the equiv- 
alent of 600 micrograms/ml of BMP-2 homodimer activity in one assay. 

[0148] Total labelled protein was also analyzed on a two-dimensional non-reducing/reducing gel system to ascertain 
whether a heterodimer is made. Preliminary results demonstrate the presence of a unique spot in this gel system that 
is not found in either the BMP-2-only or BMP-7-only cell lines, suggesting the presence of 2/7 heterodimer. The same 
so gel with purified material produced the same results (e.g., two unique spots on the gel) indicative of the presence of 
the 2/7 heterodimer. The homodimer of BMP2 produced distinct species on this gel system. 

[0149] In contrast to the recombinant BMP-2/7 purification, BMP-2 homodimers are not detected during the BMP- 
2/6 preparation; however, significant amounts of BMP-6 homodimers are found. In addition, a significant amount of a 
-20 amino acid N-terminal truncated form of BMP-6 is found; this could be eliminated by the inclusion of protease 
55 inhibitors during cell culture. BMP-2/6 was found to elute two to three fractions later from C4 RP-HPLC than did BMP- 
2/7. 

[0150] Amino acid sequencing indicates that the predominant BMP-2/7 heterodimer species comprises a mature 
BMP-2 subunit [amino acid #283(Gln)-#396(Arg)] and a mature subunit of BMP-7 [#293(Ser)-#431(His)J. BMP-2/6 
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heterodimer comprises the mature BMP-2 subunit (#283-396) and the mature BMP-6 subunit [#375{Ser)-#513(His)]. 
B. Immunoprecipitation coupled to Western blot analysis 

5 [0151] Conditioned media from a BMP-2-only (2EG5), a BMP-7-only (7MB9), or the 2E7E-10 co-expressing cell line 
were subjected to immunoprecipitation with either a BMP-2 or BMP-7 antibody (both conventional polyclonal antibodies 
raised in rabbits), then analyzed on Western blots probed with either an anti-BMP-2 or anti-BMP-7 antibody. The 2/7 
heterodimer precipitates and is reactive on Western blots with both the BMP-2 and BMP-7 antibodies, while either 
BMP by itself reacts with its specific antibody, but not with the reciprocal antibody 

10 [0152] It has been demonstrated using this strategy that a protein in the co-expressing cell line that is precipitated 
by the anti-BMP-7 antibody W33 [Genetics Institute, Inc, Cambridge, Massachusetts] and reacts on a Western blot 
with the anti-BMP-2 antibody W12 or W10 [Genetics Institute, Inc.] is not present in the BMP-2 or 7-only expressing 
cell lines. This experiment indicates that this protein species is the heterodimeric protein. Conversely, precipitation with 
W12 and probing with W33 yielded similar results. 

15 

EXAMPLE 6 - SPECIFIC ACTIVITY OF HETERODIMERS 

A. In vitro Assays 

20 [0153] The specific activity of the BMP-2/7 or BMP-2/6 heterodimer and the BMP-2 homodimer secreted into growth 
medium of the stable cell lines 2E7E-10 and 2EG55, and 12C07 and 2EG5, respectively, were estimated as follows. 
[0154] The amount of BMP protein in conditioned medium was measured by either Western blot analysis or by 
analyzing protein secreted from 35 S-methionine labelled cells by PAGE and fluorography. The amount of activity pro- 
duced by the same cell lines on W20 cells using either the alkaline phosphatase assay or osteocalcin-induction assay 

25 was then estimated. The specific activity of the BMP was calculated from the ratio of activity to protein secreted into 
the growth medium. 

[0155] In one experiment 2E7E-10 and 2EG5 secreted similar amounts of total BMP proteins as determined by PAGE 
and fluorography. 2E7E-1 0 produced about 50-fold more alkaline phosphatase inducing activity than 2EG5, suggesting 
that the specific activity of the heterodimer is about 50-fold higher than the homodimer. 
30 [0156] In another experiment the amount of BMP-2 secreted by 2EG5 was about 50% higher than BMP-2/7 secreted 
by 2E7E-10, however, 2E7E-10 produced about 1 0-fold more osteocalcin-inducing activity that 2EG5. From several 
different experiments of this type the specific activity of the BMP-2/7 heterodimer is estimated to be between 5 to 50 
fold higher than the BMP-2 homodimer. 

[0157] Figures 8 and 9 compare the activity of BMP-2 and BMP-2/7 in the W20 alkaline phosphatase and BGP (Bone 
35 Gfa Protein, osteocalcin) assays. BMP-2/7 has greatly increased specific activity relative to BMP-2 (Figure 8). From 
Figure 8, approximately 1 .3 ng/ml of BMP-2/7 was sufficient to induce 50% of the maximal alkaline phosphatase re- 
sponse in W-20 cells. A comparable value for BMP-2 is difficult to calculate, since the alkaline phosphatase response 
did not maximize, but greater than 30 ng/ml is needed for a half-maximal response. BMP-2/7 thus has a 20 to 30-fold 
higher specific activity than BMP-2 in the W-20 assay. 
40 [0158] As seen in Figure 9, BMP-2/7 was also a more effective stimulator of BGP (bone gla protein, osteocalcin) 
production than BMP-2 in this experiment. Treating W-20-17 celts with BMP-2/7 for four days resulted in a maximal 
BGP response with 62 ng/ml, and 1 1 ng/ml elicits 50% of the maximal BGP response. In contrast, maximal stimulation 
of BGP synthesis by BMP-2 was not seen with doses up to 468 ng/ml of protein. The minimal dose of BMP-2/7 needed 
to elicit a BGP response by W-20-17 cells was 3.9 ng/ml, about seven-fold less than the 29 ng/ml required of BMP-2. 
45 These results were consistent with the data obtained in the W-20-1 7 alkaline phosphatase assays for BMP-2 and BMP- 
2/7. 

[0159] Preliminary analysis indicates that BMP-2/6 has a specific activity in vitro similar to that of BMP-2/7. The 
potencies of BMP-2 and BMP-2/6 on induction of alkaline phosphatase production in W-20 is compared, as shown in 
Figure 12, BMP-2/6 has a higher specific activity than BMP-2 in this assay system. This data is in good agreement 
so with data obtained from the in vivo assay of BMP-2 and BMP-2/6). 

B. In Vivo Assay 
(i) BMP-2/7 

55 

[0160] The purified BMP-2/7 and BMP-2 were tested in the rat ectopic bone formation assay. A series of different 
amounts of BMP-2/7 or BMP-2 were implanted in triplicate in rats. After 5 and 10 days, the implants were removed 
and examined histologically for the presence of bone and cartilage. The histological scores for the amounts of new 
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cartilage and bone formed are summarized in Table A. 



Table A 
5 Day Implants 10 Day Implants 



BMP-2 


BMP-2/7 


BMP-2 




± - ± 
± - ± 
2 1 2 


- ± ± 




1 ± 1 


- ± - 


± ± ± 


2 2 2 


1 1 ± 




2 3 3 


1 1 ± 


1 ± 1 


1 1 2 


12 1 




4 4 3 


2 3 2 




± ±2 


2 2 2 




4 4 3 


3 3 3 



BMP-2/7 



0.04 C ± - ± 

B 

0.02 C ± 1 ± 

B 

1.0 C 1 ± ± 

B 

5.0 C 2 2 1 

B ± - 1 
25.0 C 

B 



[0161] The amount of BMP-2/7 required to induce cartilage and bone in the rat ectopic assay is lower than that of 
BMP-2. Histologically, the appearance of cartilage and bone induced by BMP-2/7 and BMP-2 are identical. 

(ii) BMP-2/6 

[0162] The in vivo activity of BMP-2/6 was compared with that of BMP-2 by implantation of various amounts of each 
BMP for ten days in the rat ectopic bone formation assay. The results of this study (Table B, Figure 13) indicate that 
BMP-2/6, similar to BMP-2/7, has increased in vivo activity relative to BMP-2. The specific activities of BMP-2, BMP- 
6, and BMP-2/6 are compared in the ectopic bone formation assay ten days after the proteins are implanted. The 
results of these experiments are shown in Table C and Figure 14. BMP-2/6 is a more potent inducer of bone formation 
than either BMP-2 or BMP-6. The amount of bone formation observed with BMP-2/6 was comparable to that observed 
with equivalent doses of BMP-2/7. The appearance of BMP-2/6 implants is quite similar to implants containing BMP- 
2 or BMP-2/7. 



Table B 



Histological scores of Implants of BMP 2/6 and BMP-2 In rat ectopic assay (10 day 
implants). 


BMP (u.g) 


C/B 


BMP-2/6 


BMP-2 


0.04 


C 


- ± - 






B 






0.20 


C 


1 1 ± 






B 


± ±± 




1.0 


C 


1 33 


1 1 ± 




B 


1 22 


1 1 ± 


5.0 


C 


222 


1 2 2 




B 


233 


222 
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Table B 


(continued) 




Histological scores of Implants of BMP 2/6 and BMP-2 In rat ectopic assay (10 day 


implants). 








BMP fog) 


C/B 


BMP-2/6 


BMP-2 


25. 


C 


1 1 1 


2 2 1 




B 


3 3 3 


3 3 3 



10 



15 



20 



25 



Table C 



Histological scores of implants of BMP-2, BMP-6, and BMP-2/6 in rat ectopic assay (10 day 
implants). 


BMP (M.g) 


C/B 


BMP-2 


BMP-6 


BMP-2/6 


0.04 


C 






- -± 




B 






- - + 


0.20 


C 


--2 




1 22 




B 


- - 1 




222 


1.0 


C 


- ± ± 


2 1 1 


1 1 1 




B 


- ± ± 


1 ± ± 


332 


5.0 


C 


22 1 


3 1 3 


± ± 1 




B 


1 1 1 


2± 1 


454 


25. 


C 


± ± ± 


± ± ± 


± ± ± 




B 


545 


445 


453 



EXAMPLE 7 - EXPRESSION OF BMP DIMEB IN E. COLI 



35 



[0163] A biologically active, homodimeric BMP-2 was expressed in E. coli using the techniques described in Euro- 
pean Patent Application 433,255 with minor modifications. Other methods disclosed in the above-referenced European 
patent application may also be employed to.produce heterodimers of the present invention from E. coli . Application of 
these methods to the heterodimers of this invention is anticipated to produce active BMP heterodimeric proteins from 
E. coli. 



40 



A. BMP-2 Expression Vector 



50 



55 



[0164] An expression plasmid pALBP2-781 (Figure 7) (SEQ ID NO: 1 3) was constructed containing the mature por- 
tion of the BMP-2 (SEQ ID NO: 14) gene and other sequences which are described in detail below. This plasmid 
directed the accumulation of 5-10% of the total cell protein as BMP-2 in an E. coli host strain, GI724, described below. 
[0165] Plasmid pALBP2-781 contains the following principal features. Nucleotides 1 -2060 contain DNA sequences 
originating from the plasmid pUC-1 8 [Norrander et al, Gene , 26:1 01 -1 06 (1 983)] including sequences containing the 
gene for p-lactamase which confers resistance to the antibiotic ampicillin in host E. coli strains, and a colEI-derived 
origin of replication. Nucleotides 2061-2221 contain DNA sequences for the major leftward promoter (pL) of bacteri- 
ophage X [Sanger et al, J. Mol. Biol. , 1 62 :729-773 (1982)], including three operator sequences, O l 1 , O l 2 and O l 3. 
The operators are the binding sites for Xcl repressor protein, intracellular levels of which control the amount of tran- 
scription initiation from pL. Nucleotides 2222-2723 contain a strong ribosome binding sequence included on a sequence 
derived from nucleotides 35566 to 35472 and 381 37 to 38361 from bacteriophage lambda as described in Sanger et 
al, J. Mol. Biol. , 162:729-773 (1982), Nucleotides 2724-3133 contain a DNA sequence encoding mature BMP-2 protein 
with an additional 62 nucleotides of 3'-untranslated sequence. 

[0166] Nucleotides 3134-3149 provide a "Linker" DNA sequence containing restriction endonuclease sites. Nucle- 
otides 3150-3218 provide a transcription termination sequence based on that of the E. coli asp A gene [Takagi et al, 
Nucl. Acids Res. , 13:2063-2074 (1985)]. Nucleotides 3219-3623 are DNA sequences derived from pUC-18. 
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[0167] As described below, when cultured under the appropriate conditions in a suitable E^ coli host strain, 
pALBP2-781 can direct the production of high levels (approximately 10% of the total cellular protein) of BMP-2 protein. 
[01 68] pALBP2-781 was transformed into the E. coli host strain GI724 (F, lacH, lacP L8 , ampC: :Xcl + ) by the procedu re 
of Dagert and Ehrtich, Gene , 6:23 (1979). [The untransformed host strain E. coli G 1 724 was deposited with the American 
Type Culture Collection, 12301 Parklawn Drive, Rockvilie, Maryland on January 31, 1991 under ATCC No. 55151 for 
patent purposes pursuant to applicable laws and regulations.] Transformants were selected on 1 .5% w/v agar plates 
containing IMC medium, which is composed of M9 medium [Miller, "Experiments in Molecular Genetics", Cold Spring 
Harbor Laboratory, New York (1972)] supplemented with 0.5% w/v glucose, 0.2% w/v casamino acids and 100 u.g/ml 
ampicillin. 

[01 69] GI724 contains a copy of the wild-type Xcl repressor gene stably integrated into the chromosome at the amp C 
locus, where it has been placed under the transcriptional control of Salmonella typhimurium trp promoter/operator 
sequences. In GI724, Acl protein is made only during growth in tryptophan-free media, such as minimal media or a 
minimal medium supplemented with casamino acids such as IMC, described above. Addition of tryptophan to a culture 
of GI724 will repress the tno promoter and turn off synthesis of A.cl, gradually causing the induction of transcription from 
pL promoters if they are present in the cell. 

[0170] GI724 transformed with pALBP2-781 was grown at 37°C to an A 550 of 0.5 (Absorbence at 550 nm) in IMC 
medium. Tryptophan was added to a final concentration of 100 u,g/ml and the culture incubated for a further 4 hours. 
During this time BMP-2 protein accumulated to approximately 10% of the total cell protein, alt in the "inclusion body" 
fraction. 

[0171] BMP-2 is recovered in a non-soluble, monomeric form as follows. Cell disruption and recovery is performed 
at 4°C. Approximately 9 g of the wet fermented E. coli GI724/pALBP2-781 cells are suspended in 30 ml_ of 0.1 M Tris/ 
HCI, 10 mM EDTA, 1 mM phenyl methyl sulphonyl fluoride (PMSF), pH 8.3 (disruption buffer). The cells are passed 
four times through a cell disrupter and the volume is brought to 1 00 ml_ with the disruption buffer. The suspension is 
centrifuged for 20 min. (1 5,000 x g). The pellet obtained is suspended in 50 ml_ disruption buffer containing 1 M NaCI 
and centrifuged for 10 min. as above. The pellet is suspended in 50 mL disruption buffer containing 1% Triton X-100 
(Pierce) and again centrifuged for 10 min. as above. The washed pellet is then suspended in 25 mL of 20 mM Tris/ 
HCI, 1 mM EDTA, 1 mM PMSF, 1% DTT, pH 8.3 and homogenized in a glass homogenizer. The resulting suspension 
contains crude monomeric BMP-2 in a non-soluble form. 

[0172] Ten mL of the BMP-2 suspension, obtained as described above, are acidified with 1 0% acetic acid to pH 2.5 
and centrifuged in an Eppendorf centrifuge for 10 min. at room temperature. The supernatant is chromatographed. 
Chromatography was performed on a Sephacryl S-100 HR column (Pharmacia, 2.6 x 83 cm) in 1% acetic acid at a 
flow rate of 1.4 mL/minute. Fractions containing monomeric, BMP-2 are pooled. This material is used to generate 
biologically active, homodtmer BMP-2. 

[0173] Biologically active, homodimeric BMP-2 can be generated from the monomeric BMP-2 obtained following 
solubilization and purification, described above, as follows. 

[0174] 0.1 , 0.5 or 2.5 mg of the BMP-2 is dissolved at a concentration of 20, 1 00 or 500 u,g/mL, respectively, in 50 
mMTris/HCI.pHS.O, 1 M NaCI, 5mM EDTA, 2mM reduced glutathione, 1 mM oxidized glutathione and 33 mM CHAPS 
[Calbiochem]. After 4 days at 4°C or 23°C, the mixture is diluted 5 to 10 fold with 0.1% TFA. 

[0175] Purification of biologically active BMP-2 is achieved by subjecting the diluted mixture to reverse phase HPLC 
on a a Vydac C4 214TP54 column (25 x .46 cm) [The NEST Group, USA] at a flow rate of 1 ml/minute. Buffer A is 
0.1% TFA. Buffer B is 90% acetonitrile, and 0.1% TFA. The linear gradient was 0 to 5 minutes at 20% Buffer B; 5 to 
10 minutes at 20 to 30 % Buffer B; 1 0 to 40 minutes at 30 to 60% Buffer B; and 40 to 50 minutes at 60 to 100% Buffer 
B. Homodimeric BMP-2 is eluted and collected from the HPLC column. 

[0176] The HPLC fractions are lyophilized to dryness, redissolved in sample buffer (1 .5 M Tris-HCI, pH 8.45, 12% 
glycerol, 4% SDS, .0075% Serva Blue G, .0025% Phenol Red, with or without 100 mM dithiothreitol) and heated for 
five minutes at 95°C. The running buffer is 100 mM Tris, 100 mM tricine (16% tricine gel) [Novex], 0.1% SDS at pH 
8.3. The SDS-PAGE gel is run at 125 volts for 2.5 hours. 

[0177] The gel is stained for one hour with 200 ml of 0.5% Coomassie Brilliant Blue R-250, 25% isopropanol, 10% 
acetic acid, heated to 60°C. The gel is then destained with 10% acetic acid, 10% isopropanol until the background is 
clear. 

[0178] The reduced material ran at approximately 13kD; the non-reduced material ran at approximately 30 kD, which 
is indicative of the BMP-2 dimer. This material was later active in the W20 assay of Example 8. 

B. BMP-7 Expression Vector 

[0179] For high level expression of BMP-7 a plasmid pALBMP7-981 was constructed. pAIBMP7-981 is identical to 
plasmid pALBP2-781 with two exceptions: the BMP-2 gene (residues 2724-3133 of pALBP2-781) is replaced by the 
mature portion of the BMP-7 gene, deleted for sequenced encoding the first seven residues of the mature BMP-7 
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protein sequence: 



ATGTCTCATAATC GTTCTAAAAC TCCAAAAAAT CAAGAAGCTC TGCGTATGGC 



CAACGTGGCA 


GAGAACAGCA 


GCAGCGACCA 


GAGGCAGGCC 


TGT AAGAAGC 


ACGAGCTGTA 


TGTCAGCTTC 


CGAGACCTGG 


GCTGG CAGG A 


CTGGATCATC 


GCGCCTGAAG 


GCTACGCCGC 


CTACTACTGT 


GAGGGGGAGT 


GTGCCTTCCC 


TCTGAACTCC 


TACATGAACG 


CCACCAACCA 


CGCCATCGTG 


CAGACGCTGG 


TCCACTTCAT 


CAACCCGGAA 


ACGGTGCCCA 


AGCCCTGCTG 


TGCGCCCACG 


CAGCTCAATG 


CCATCTCCGT 


CCTCTACTTC 


GATGACAGCT 


CCAACGTCAT 


CCTGAAGAAA 


TACAGAAACA 


TGGTGGTCCG 


GGCCTQTGGC 


TGCCACTAGC 


TCCTCCGAGA 


ATTCAGACCC 


TTTGGGGCCA 


AGTTTTTCTG 


GATCCT 



and the ribosome binding site found between residues 2707 and 2723 in pALBP2-781 is replaced by a different ribos- 
ome binding site, based on that found preceding the T7 phage gene 10, of sequence 5'-CAAGAAGGAGATATACAT- 
3'. The host strain and growth conditions used for the production of BMP-7 were as described for BMP-2. 

C. BMP-3 Expression Vector 

[01 80] For high level expression of BM P-3 a plasmid pALB3-782 was constructed. This plasmid is identical to plasmid 
pALBP2-781, except that the BMP-2 gene (residues 2724-3133 of pALBP2-781) is replaced by a gene encoding a 
form of mature BMP-3. The sequence of this BMP-3 gene is: 

ATGCGTAAAC AATGGATTGA ACCACGTAAC TGTGCTCGTC GTTATCTGAA 
AGTAGACTTT GCAGATATTG GCTGG AGTG A ATGGATTATC TCCCCCAAGT 
CCTTTGATGC CTATTATTGC TCTGGAGCAT GCCAGTTCCC CATGCCAAAG 
TCTTTGAAGC CATCAAATCA TGCTACCATC CAGAGTATAG TGAGAGCTGT 
GGGGGTCGTT CCTGGGATTC CTGAGCCTTG CTGTGTACCA GAAAAGATGT 
CCTCACTCAG TATTTTATTC TTTGATGAAA AT AAG AATGT AGTGCTTAAA 
GTATACCCTA ACATGACAGT AGAGTCTTGC GCTTGCAGAT AACCTGGCAA 
AGAACTCATT TGAATGCTTA ATTCAAT 

[0181] The host strain and growth conditions used for the production of BMP-3 were as described for BMP-2. 

D. Expression of a BMP-2/7 Heterodimer in E. coli 

[01 82] Denatured and purified E. coli BM P-2 and BM P-7 monomers were isolated from E. coli inclusion body pellets 
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by acidification and gel filtration as previously as previously described above. 125 ug of each BMP in 1% acetic acid 
were mixed and taken to dryness in a speed vac. The material was resuspended in 2.5 ml 50 mM Tris ( 1 .0 NaCI, 5 
mM EDTA, 33 mM CHAPS, 2 mM glutathione (reduced), 1 mM glutathione (oxidized), pH 8.0. The sample was incubated 
at 23 C for one week. 

[0183] The BMP-2/7 heterodimer was isolated by HPLC on a 25 x 0.46 cm Vydac C4 column. The sample was 
centrifuged in a microfuge for 5 minutes, and the supernatant was diluted with 22.5 ml 0.1% TFA. 

A buffer: 0.1% TFA 

B buffer : 0.1% TFA, 95% acetonitrile 



1 .0 ml/minute 


0-5' 


20% B 


5-10' 


20-30% B 


10-90' 


30-50% B 


90-100' 


50-100% B 



By SDS-PAGE analysis, the BMP-2/7 heterodimer eluted at about 23'. 

[0184] Figure .1 0 is a comparison of the W-20 activity of E. coli BMP-2 and BMP-2/7 heterodimer, indicating greater 
activity of the heterodimer. 

R Expression of BMP-2/3 Heterodimer in E. coli 

[0185] BMP-2 and BMP-3 monomers were isolated as follows: to 1 .0 g of frozen harvested cells expressing either 
BMP-2 or BMP-3 was added 3.3 ml of 1 00 mM Tris, 1 0 mM EDTA, pH 8.3. The cells were resuspended by vortexing 
vigorously. 33 ul of 1 00 mM PMSF in isopropanol was added and the cells lysed by one pass through a French pressure 
cell. The lysate was centrifuged in a microfuge for 20 minutes at 4 C. The supernatant was discarded. The inclusion 
body pellet was taken up in 8.0 M quanidine hydrochloride, 0.25 M OTT, 0.5 M Tris, 5 mM EDTA, pH 8.5, and heated 
at 37 C for one hour. 

[0186] The reduced and denatured BMP monomers were isolated by HPLC on a SupelcoC4 guard column as follows: 

A buffer: 0.1% TFA 

B buffer : 0.1% TFA, 95% acetonitrile 



1 .0 ml/minute 


0-5' 


1% B 


5-40* 


1-70% B 


40-45* 


70-100% B 



Monomeric BMP eluted at 28-30'. Protein concentration was estimated by A280 and the appropriate extinction coeffi- 
cient. 

[0187] 10 ug of BMP-2 and BMP-3 were combined and taken to dryness in a speed vac. To this was added 50 ul of 
50 mM Tris, 1 .0 M NaCI, 5 mM EDTA, 33 mM CHAPS, 2 mM reduced glutathione, 1 mM oxidized glutathione, pH 8.5. 
The sample was incubated at 23 for 3 days. The sample was analyzed by SDS-PAGE on a 16% tricine gel under 
reducing and nonreducing conditions. The BMP-2/3 heterodimer migrated at about 35 kd nonreduced, and reduced to 
BMP-2 monomer at about 13 kd and BMP-3 monomer at about 21 kd. 

[0168] BMP-2/3 heterodimer produced in E, coli\s tested for in vivo activity. (20 jig) at (ten days) is utilized to compare 
the in vivo activity of BMP-2/3 to BMP-2. BMP-2/3 implants showed no cartilage or bone forming activity, while the 
BMP-2 control implants showed the predicted amounts of bone and cartilage formation. The in vivo data obtained with 
BMP-2/3 is consistent with the in vitro data from the W-20 assay. 

EXAMPLE 8 - W-20 BIOASSAYS 

A. Description of W-20 cells 

[0189] Use of the W-20 bone marrow stromal cells as an indicator cell line is based upon the conversion of these 
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cells to osteoblast-like cells after treatment with BMP-2 [R. S. Thies et al, "Bone Morphogenetic Protein alters W-20 
stromal cell differentiation in vitro", Journal of Bone and Mineral Research , 5(2):305 (1990); and R. S. Thies et al, 
"Recombinant Human Bone Morphogenetic Protein 2 Induces Osteoblastic Differentiation in W-20-1 7 Stromal Cells", 
Endocrinology , in press (1 992)]. Specifically, W-20 cells are a clonal bone marrow stromal cell line derived from adult 

5 mice by researchers in the laboratory of Dr. D. Nathan, Children's Hospital, Boston, MA. BMP-2 treatment of W-20 
cells results in (1 ) increased alkaline phosphatase production, (2) induction of PTH stimulated cAMP, and (3) induction 
of osteocalcin synthesis by the cells. While (1) and (2) represent characteristics associated with the osteoblast phe- 
notype, the ability to synthesize osteocalcin is a phenotypic property only displayed by mature osteoblasts. Further- 
more, to date we have observed conversion of W-20 stromal cells to osteoblast-like cells only upon treatment with 

10 BMPs. In this manner, the in vitro activities displayed by BMP treated W-20 cells correlate with the in vivo bone forming 
activity known for BMPs. 

[0190] Below two in vitro assays useful in comparison of BMP activities of novel osteoinductive molecules are de- 
scribed. 

15 8. W-20 Alkaline Phosphatase Assay Protocol 

[0191] W-20 cells are plated into 96 well tissue culture plates at a density of 10,000 cells per well in 200 u.l of media 
(DME with 10% heat inactivated fetal calf serum, 2 mM glutamine and 100 U/ml + 100 u.g/ml streptomycin. The cells 
are allowed to attach overnight in a 95% air, 5% C0 2 incubator at 37°C. 
20 [0192] The 200 uJ of media is removed from each wellwith a multichannel pipettor and replaced with an equal volume 
of test sample delivered in DME with 10% heat inactivated fetal calf serum, 2 mM glutamine and 1% penicillin-strep- 
tomycin. Test substances are assayed in triplicate. 

[0193] The test samples and standards are allowed a 24 hour incubation period with the W-20 indicator cells. After 
the 24 hours, plates are removed from the 37°C incubator and the test media are removed from the cells. 
25 [0194] The W-20 cell layers are washed 3 times with 200 uJ per well of calcium/magnesium free phosphate buffered 
saline and these washes are discarded. 

[0195] 50 jxl of glass distilled water is added to each well and the assay plates are then placed on a dry ice/ethanol 
bath for quick freezing. Once frozen, the assay plates are removed from the dry ice/ethanol bath and thawed at 37°C. 
This step is repeated 2 more times for a total of 3 freeze-thaw procedures. Once compfete, the membrane bound 
30 alkaline phosphatase is available for measurement. 

[0196] 50 uJ of assay mix (50 mM glycine, 0.05% Triton X-100, 4 mM MgCI 2 , 5 mM p-nitrophenol phosphate, pH = 
1 0.3) is added to each assay well and the assay plates are then incubated for 30 minutes at 37°C in a shaking waterbath 
at 60 oscillations per minute. 

[0197] At the end of the 30 minute incubation, the reaction is stopped by adding 100 uJ of 0.2 N NaOH to each well 
35 and placing the assay plates on ice. 

[0198] The spectrophotometric absorbance for each well is read at a wavelength of 405 nanometers. These values 
are then compared to known standards to give an estimate of the alkaline phosphatase activity in each sample. For 
example, using known amounts of p-nitrophenol phosphate, absorbance values are generated. This is shown in Table I. 



Table I 



Absorbance Values for Known Standards of P-Nitrophenol Phosphate 


P-nitrophenol phosphate umoles 


Mean absorbance (405 nm) 


0.000 


0 


0.006 


0.261 +/- .024 


0.012 


0.521 +/- .031 


0.018 


0.797 +■/- .063 


0.024 


1.074 +/- .061 


0.030 


1.305+/- .083 



[01 99] Absorbance values for known amounts of BM P-2 can be determined and converted to umoles of p-nitrophenol 
phosphate cleaved per unit time as shown in Table II. 

55 
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Table II 



Alkaline Phosphatase Values for W-20 Cells Treating with BMP-2 


BMP-2 concentration ng/ml 


Absorbance Reading 405 nmeters 


umoles substrate per hour 


0 


0.645 


0.024 


1.56 


0.696 


0.026 


3.12 


0,765 


0.029 


6.25 


0.923 


0.036 


12.50 


1.121 


0.044 


25.0 


1.457 


0.058 


50.0 


1.662 


0.067 


100.0 


1.977 


0.080 



[0200] These values are then used to compare the activities of known amounts of BMP heterodimers to BMP-2 
homodimer. 

C. Osteocalcin RIA Protocol 

[0201] W-20 cells are plated at 10 6 cells per well in 24 well multiwell tissue culture dishes in 2 mis of DME containing 
10% heat inactivated fetal calf serum, 2 mM glutamine. The cells are allowed to attach overnight in an atmosphere of 
95% air 5% C0 2 at 37°C. 

[0202] The next day the medium is changed to DME containing 1 0% fetal calf serum, 2 mM glutamine and the test 
substance in a total volume of 2 ml. Each test substance is administered to triplicate wells. The test substances are 
incubated with the W-20 cells for a total of 96 hours with replacement at 48 hours by the same test medias. 
[0203] At the end of 96 hours, 50 \l\ of the test media is removed from each well and assayed for osteocalcin pro- 
duction using a radioimmunoassay for mouse osteocalcin. The details of the assay are described in the kit manufactured 
by Biomedical Technologies Inc., 378 Page Street, Stoughton, MA 02072. Reagents for the assay are found as product 
numbers BT-431 (mouse osteocalcin standard), BT-432 (Goat anti-mouse Osteocalcin), BT-431R (iodinated mouse 
osteocalcin), BT-415 (normal goat serum) and BT-414 (donkey anti goat IgG). The RIA for osteocalcin synthesized by 
W-20 cells in response to BMP treatment is carried out as described in the protocol provided by the manufacturer. 
[0204] The values obtained for the test samples are compared to values for known standards of mouse osteocalcin 
and to the amount of osteocalcin produced by W-20 cells in response to challenge with known amounts of BMP-2. The 
values for BMP-2 induced osteocalcin synthesis by W-20 cells is shown in Table III. 



Table III 



Osteocalcin Synthesis by W-20 Cells 


BMP-2 Concentration ng/ml 


Osteocalcin Synthesis ng/well 


0 


0.8 


2 


0.9 


4 


0.8 


8 


2.2 


16 


2.7 


31 


3.2 


62 


5.1 


125 


6.5 


250 


8.2 


500 


9.4 


1000 


10.0 



EXAMPLE 9 - ROSEN MODIFIED SAMPATH-REDDI ASSAY 

[0205] A modified version of the rat bone formation assay described in Sampath and Reddi, Proc. Natl. Acad. Sci. 
USA, 80 :6591-6595 (1983) is used to evaluate bone and/or cartilage activity of BMP proteins. This modified assay is 
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herein called the Rosen-modified Sampath-Reddi assay. The ethanol precipitation step of the Sampath-Reddi proce- 
dure is replaced by dialyzing (if the composition is a solution) or diafiltering (if the composition is a suspension) the 
fraction to be assayed against water. The solution or suspension is then redissolved in 0.1% TFA, and the resulting 
solution added to 20 mg of rat matrix. A mock rat matrix sample not treated with the protein serves as a control. This 
5 material is frozen and lyophilized and the resulting powder enclosed in #5 gelatin capsules. The capsules are implanted 
subcutaneously in the abdominal thoracic area of 21-49 ay old male Long Evans rats. The implants are removed after 
7-14 days. Half of each implant is used for alkaline phosphatase analysis [see, A. H. Reddi et al, Proc. Natl. Acad. 
ScL, 69:1601 (1972)]. 

[0206] The other half of each implant is fixed and processed for histological analysis. 1 u/n glycolmethacryfate sec- 
w tions are stained with Von Kossa and acid fuschin to score the amount of induced bone and cartilage formation present 
in each implant. The terms +1 through +5 represent the area of each histological section of an implant occupied by 
new bone and/or cartilage cells and matrix. A score of +5 indicates that greater than 50% of the implant is new bone 
and/or cartilage produced as a direct result of protein in the implant. A score of +4, +3, +2, and +1 would indicate that 
greater than 40%, 30%, 20% and 1 0% respectively of the implant contains new cartilage and/or bone. 
15 [0207] The heterodimeric BMP proteins of this invention may be assessed for activity on this assay. 

[0208] Numerous modifications and variations in practice of this invention are expected to occur to those skilled in 
the art. Such modifications and variations are encompassed within the following claims. 

SEQUENCE LISTING 

20 

(1) GENERAL INFORMATION: 
[0209] 

25 (j) APPLICANT: Israel, David 

Wolfman, Neil M. 

(ii) TITLE OF INVENTION: Recombinant Bone Morphogenetic Protein Heterodimers, Compositions and Methods 
of Use. 

30 

(iii) NUMBER OF SEQUENCES: 30 

(iv) CORRESPONDENCE ADDRESS: . 

35 (A) ADDRESSEE: Legal Affairs, Genetics Institute, Inc. 

(B) STREET: 87 CambridgePark Drive 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: USA 
40 (F) ZIP: 02140-2387 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Tape 
45 (B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1 .0, Version #1 .25 

(vi) CURRENT APPLICATION DATA: 

50 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

55 (viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Kapinos, Ellen J. 

(B) REGISTRATION NUMBER: 32,245 
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(C) REFERENCE/DOCKET NUMBER: G1-5192B 
(ix) TELECOMMUNICATION INFORMATION: 

5 (A) TELEPHONE: 61 7-876-1 1 70 

(B) TELEFAX: 61 7-876-5851 

(2) INFORMATION FOR SEQ ID NO:1: 
10 [0210] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 

20 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 356. .1543 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 
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GTCGACTCTA GAGTGTGTGT CAGCACTTGG CTGGGGACTT CTTGAACTTG CAGGGAGAAT 60 

AACTTGCGCA CCCCACTTTG CGCCGGTGCC TTTGCCCCAG CGGAGCCTGC TTCGCCATCT 120 

5 CCGAGCCCCA CCGCCCCTCC ACTCCTCGGC CTTGCCCGAC ACTGAGACGC TGTTCCCAGC 180 

GTGAAAAGAG AGACTGCGCG GCCGGCACCC GGGAGAAGGA GGAGGCAAAG AAAAGGAACG 2 40 

GACATTCGGT CCTTGCGCCA GGTCCTTTGA CCAGAGTTTT TCCATGTGGA CGCTCTTTCA 3 00 

10 ATGGACGTGT CCCCGCGTGC TTCTTAGACG GACTGCGGTC TCCTAAAGGT CGACC ATG 3 58 

Met 
1 

GTG GCC GGG ACC CGC TGT CTT CTA GCG TTG CTG CTT CCC CAG GTC CTC 4 06 

val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val Leu 
is 5 10 15 

CTG GGC GGC GCG GCT GGC CTC GTT CCG GAG CTG GGC CGC AGG AAG TTC 4 54 

Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys Phe 
20 25 30 

20 GCG GCG GCG TCG TCG GGC CGC CCC TCA TCC CAG CCC TCT GAC GAG GTC 502 

Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu Val • 
35 40 45 

CTG AGC GAG TTC GAG TTG CGG CTG CTC AGC ATG TTC GGC CTG AAA CAG 55 0 

Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys Gin 
25 50 55 60 65 

AGA CCC ACC CCC AGC AGG GAC GCC GTG GTG CCC CCC TAC ATG CTA GAC 59 8 

Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu Asp 
70 75 80 

30 CTG TAT CGC AGG CAC TCA GGT CAG CCG GGC TCA CCC GCC CCA GAC CAC 64 6 

Leu Tyr Arg Arg His Ser Gly Gin Pro Gly ser Pro Ala Pro Asp His 
85 90 95 

CGG TTG GAG AGG GCA GCC AGC CGA GCC AAC ACT GTG CGC AGC TTC CAC 69 4 

Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Ph<* His 
35 100 105 110 

CAT GAA GAA TCT TTG GAA GAA CTA CCA GAA ACG AGT GGG AAA ACA ACC 7 42 

His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr Thr 
115 120 125 

40 CGG AGA TTC TTC TTT AAT TTA AGT TCT ATC CCC ACG GAG GAG TTT ATC 790 
Arg Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu Glu Phe He 
130 135 140 145 

ACC TCA GCA GAG CTT CAG GTT TTC CGA GAA CAG ATG CAA GAT GCT TTA 838 
Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala Leu 

45 150 155 160 

GGA AAC AAT AGC AGT TTC CAT CAC CGA ATT AAT ATT TAT GAA ATC ATA 88 6 

Gly Asn Asn Ser Ser Phe His His Arg He Asn He Tyr Glu He He 
165 170 175 

50 AAA CCT GCA ACA GCC AAC TCG AAA TTC CCC GTG ACC AGA CTT TTG GAC 93 4 

Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu Asp 



55 
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10 



180 185 190 

ACC AGG TTG GTG AAT CAG AAT GCA AGC AGG TGG GAA ACT TTT GAT GTC 9e2 
Thr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Thr Phe Asp Val 
195 200 205 

ACC CCC GCT GTG ATG CGG TGG ACT GCA CAG GGA CAC GCC AAC CAT GGA 103 0 

Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His Gly 
210 215 220 225 

TTC GTG GTG GAA GTG GCC CAC TTG GAG GAG AAA CAA GGT GTC TCC AAG 1078 
Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser Lys 
230 235 240 

AG A CAT GTT AGG ATA AGC AGG TCT TTG CAC CAA GAT GAA CAC AGC TGG 1126 
Arg His Val Arg lie Ser Arg Ser Leu His Gin Asp Glu His Ser Trp 
245 250 255 

TCA CAG ATA AGG CCA TTG CTA GTA ACT TTT GGC CAT GAT GGA AAA GGG 1174 
Ser Gin lie Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys Gly 
260 265 270 

CAT CCT CTC CAC AAA AG A GAA AAA CGT CAA GCC AAA CAC AAA CAG CGG 12 28 

His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin Arg 
275 280 285 

AAA CGC CTT AAG TCC AGC TGT AAG AGA CAC CCT TTG TAC GTG GAC TTC 12 70 

Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp Phe 
290 295 300 305 

AGT GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC 1318 
Ser Asp Val Gly Trp Asn Asp Trp lie Val Ala Pro Pro Gly Tyr His 
310 315 320 

GCC TTT TAC TGC CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG 13 66 

Ala Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His Leu 
325 330 335 

AAC TCC ACT AAT CAT GCC ATT GTT CAG ACG TTG CTC AAC TCT GTT AAC 1414 
Asn Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn 
340 345 350 

TCT AAG ATT CCT AAG GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT ATC 14 66 

Ser Lys Xle Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He 
355 360 365 

TCG ATG CTG TAC CTT GAC GAG AAT GAA AAG GTT GTA TTA AAG AAC TAT 1510 
Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr 
40 370 375 3BO 385 

CAG GAC ATG GTT GTG GAG GGT TGT GGG TGT CGC TAGTACAGCA AAATTAAATA 1562 
Gin Asp Met Val Val Glu Gly Cys Gly Cys Arg 
390 395 

45 CAT AAAT AT A TAT AT AT AT A TAT A TTTT A G AAAAAAGAAA AAAA 1607 

(2) INFORMATION FOR SEQ ID NO:2: 
50 [0211] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 396 amino acids 
55 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



Met Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 
15 10 15 

Leu Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys 
20 25 30 

Phe Ala Ala Ala ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu 

40 45 



35 



Val Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys 
50 55 60 

Gin Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu 

65 70 - 7 5 80 

Asp Leu Tyr Arg Arg Hip Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp 

— 90 95 



85 



His Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe 
100 105 HO 

His His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr 
115 120 125 

Thr Arg Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu Glu Phe 
130 135 140 

lie Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala 
145 150 155 160 

Leu Gly Asn Asn Ser Ser Phe His His Arg lie Asn He Tyr Glu He 
165 170 175 

He Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu 
180 185 190 

Asp Thr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Thr Phe Asp 
195 200 205 

• Val Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His 
210 215 220 

Gly Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser 
225 230 235 240 

Lys Arg His Val Arg He Ser Arg Ser Leu His Gin Asp Glu His Ser 
250 255 



245 



Trp Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys 
265 270 



260 



Gly His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin 
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275 280 285 

Arg Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp 
290 295 300 

Phe Ser Asp Val Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr 
305 310 315 320 

His Ala Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His 
325 330 335 

Leu Asn Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser Val 
340 345 350 

Asn Ser Lys He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala 
355 360 365 

He Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn 
370 375 380 

Tyr Gin Asp Met Val Val Glu Gly Cys Gly Cys Arg 
385 390 395 

(2) INFORMATION FOR SEQ ID NO:3: 
[0212] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 403. .1626 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
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CTCTAGAGGG 


CAGAGGAGGA 


GGGAGGGAGG 


GAAGGAGCGC 


GG AG CCCGGC 


CCGGAAGCTA 


60 


GGTGAGTGTG 


GCATCCGAGC 


TGAGGGACGC 


GAGCCTGAGA 


CGCCGCTGCT 


GCTCCGGCTG 


120 


AGTATCTAGC 


TTGTCTCCCC 


GATGGGATTC 


CCGTCCAAGC 


TATCTCGAGC 


CTGCAGCGCC 


180 


ACAGTCCCCG 


GCCCTCGCCC 


AGGTTCACTG 


CAACCGTTCA 


GAGGTCCCCA 


GGAGCTGCTG 


240 


CTGGCGAGCC 


CGCTACTGCA 


GGGACCTATG 


GAGCCATTCC 


GTAGTGCCAT 


CCCGAGCAAC 


300 


GCACTGCTGC 


AGCTTCCCTG 


AGCCTTTCCA 


GCAAGTTTGT 


TCAAGATTGG 


CTGTCAAGAA 


3 6.0 


TCATGGACTG 


TTATTATATG 


CCTTGTTTTC 


TGTCAAGACA 


CC ATG ATT 
Met lie 


CCT GGT 
Pro Gly 


414 



1 

AAC CGA ATG CTG ATG GTC GTT TTA TTA TGC CAA GTC CTG CTA GGA GGC 4 62 

Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly 

5 10 15 20 



20 



25 



30 



35 



40 



45 



50 
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GCG AGC CAT GCT AGT TTG ATA CCT GAG ACG GGG AAG AAA AAA GTC GCC 
Ala ser His Ala Ser Leu lie Pro Glu Thr Gly Lys Lys Lys Val Ala 
25 30 35 

GAG ATT CAG GGC CAC GCG GGA GGA CGC CGC TCA GGG CAG AGC CAT GAG 
Glu lie Gin Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu 

40 45 50 

CTC CTG CGG GAC TTC GAG GCG ACA CTT CTG CAG ATG TTT GGG CTG CGC 
Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met Phe Gly Leu Arg 
55 60 65 

CGC CGC CCG CAG CCT AGC AAG AGT GCC GTC ATT CCG GAC TAC ATG CGG 
Arg Arg Pro Gin Pro Ser Lys Ser Ala Val lie Pro Asp Tyr Met Arg 
70 75 80 

GAT CTT TAC CGG CTT CAG TCT GGG GAG GAG GAG GAA GAG CAG ATC CAC 
Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu Glu Gin He His 
85 90 95 100 

AGC ACT GGT CTT GAG TAT CCT GAG CGC CCG GCC AGC CGG GCC AAC ACC 
Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser Arg Ala Asn Thr 
105 110 115 

GTG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC CCA GGG ACC 
Val Arg Ser Phe His His Glu Glu His Leu Glu Asn He Pro Gly Thr 
120 125 130 

AGT GAA AAC TCT GCT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC CCT 
Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser Ser He Pro 
135 140 145 

GAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG 
Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin 
150 155 160 

GTG GAC CAG GGC CCT GAT TGG GAA AGG GGC TTC CAC CGT ATA AAC ATT 

Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg He Asn He 
165 170 175 ' 180 

TAT GAG GTT ATG AAG CCC CCA GCA GAA GTG GTG CCT GGG CAC CTC ATC 
Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro Gly His Leu He 
185 190 195 

ACA CGA CTA CTG GAC ACG AG A CTG GTC CAC CAC AAT GTG ACA CGG TGG 
Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn Val Thr Arg Trp 
200 205 210 

GAA ACT TTT GAT GTG AGC CCT GCG GTC CTT CGC TGG ACC CGG GAG AAG 
Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp Thr Arg Glu Lys 
215 220 225 

CAG CCA AAC TAT GGG CTA GCC ATT GAG GTG ACT CAC CTC CAT CAG ACT 
Gin Pro Asn Tyr Gly Leu Ala lie Glu Val Thr His Leu His Gin Thr 
230 235 240 

CGG ACC CAC CAG GGC CAG CAT GTC AGG ATT AGC CGA TCG TTA CCT CAA 
Arg Thr His Gin Gly Gin His Val Arg lie Ser Arg Ser Leu Pro Gin 
245 250 255 260 
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GGG AGT GGG AAT TGG GCC CAG CTC CGG CCC CTC CTG GTC ACC TTT GGC 12 3 0 

Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu Val Thr Phe Gly 
265 270 275 

CAT GAT GGC CGG GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG CGT 12 7 B 

His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala Lys Arg 
2B0 285 290 

AGC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAG AAT AAG AAC TGC 13 2 6 

Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys 

295 300 305 

CGG CGC CAC TCG CTC TAT GTG GAC TTC AGC GAT GTG GGC TGG AAT GAC 13 7 4 

Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp 
310 315 320 

TGG ATT GTG GCC CCA CCA GGC TAG CAG GCC TTC TAC TGC CAT GGG GAC 14 2 2 

Trp He Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly Asp 
325 330 335 340 

TGC CCC TTT CCA CTG GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC ATT 14 70 

Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala lie 

345 350 355 

GTG CAG ACC CTG GTC AAT TCT GTC AAT TCC AGT ATC CCC AAA GCC TGT 1518 
val Gin Thr Leu Val Asn Ser Val Asn Ser ser lie Pro Lys Ala Cys 
360 365 370 

TGT GTG CCC ACT GAA CTG AGT GCC ATC TCC ATG CTG TAC CTG GAT GAG 1566 
Cys Val Pro Thr Glu Leu Ser Ala He ser Met Leu Tyr Leu Asp Glu 
375 380 385 

TAT GAT AAG GTG GTA CTG AAA AAT TAT CAG GAG ATG GTA GTA GAG GGA 1614 
Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val Glu Gly 
390 395 400 

TGT GGG TGC CGC TGAGATCAGG CAGTCCTTGA GGATAGACAG ATATACACAC 1666 
Cys Gly Cys Arg 
405 

CACACACACA CACCACATAC ACCACACACA CACGTTCCCA TCCACTCACC CACACACTAC 172 6 

35 ACAGACTGCT TCCTTATAGC TGGACTTTTA TTTAAAAAAA AAAAAAAAAA AATGGAAAAA 17 86 

ATC CCT AAA C ATTCACCTTG ACCTTATTTA TGACTTTACG TGCAAATGTT TTGACCATAT 184 6 

TG ATC AT AT A TTTTGACAAA ATATATTTAT AACTACGTAT TAAAAGAAAA AAATAAAATG 19 06 

40 AGT C ATT ATT TTAAAAAAAA AAAAAAAACT CTAGAGTCGA CGGAATTC 1954 

(2) INFORMATION FOR SEQ ID NO:4: 
45 [0213] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 amino acids 
so (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 



20 
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Met He Pro Gly Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val 
15 10 15 

, Leu Leu Gly Gly Ala Ser His Ala Ser Leu He Pro Glu Thr Gly Lys 

20 25 30 

Lys Lys Val Ala Glu He Gin Gly His Ala Gly Gly Arg Arg Ser Gly 
35 40 45 

10 Gin Ser His Glu Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met 

50 55 60 

Phe Gly Leu Arg Arg Arg Pro Gin Pro Ser Lys Ser Ala Val lie Pro 
65 70 75 80 



15 



20 



35 



50 



Asp Tyr Met Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu 

85 90 95 

Glu Gin He His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser 
100 105 110 

V 

Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn 
115 120 125 

He Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu 
130 135 140 

Ser Ser He Pro Glu Asn Glu Val He ser Ser Ala Glu Leu Arg Leu 
145 150* 155 160 

Phe Arg Glu Gin Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His 
165 170 175 

Arg He Asn He Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro 
180 185 190 

Gly His Leu He Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn 
195 200 205 

Val Thr Arg Trp Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp 
210 215 220 

Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala He Glu Val Thr His 
225 230 235 240 

Leu His Gin Thr Arg Thr His Gin Gly Gin His Val Arg He Ser Arg 
245 250 255 

Ser Leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu 
260 265 270 

Val Thr Phe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg 
275 280 285 

Arg Ala Lys Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys 
290 295 300 
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Asn Lys Asn Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val 
305 310 315 320 

Gly Trp Asn Asp Trp lie Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr 
325 330 335 

Cys His Gly Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr 
340 345 350 

Asn His Ala lie Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser lie 
355 360 365 

Pro Lys Ala cys cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu 
370 375 380 

Tyr Leu Asp Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met 
385 390 395 400 

Val Val Glu Gly Cys Gly cys Arg 
405 



(2) INFORMATION FOR SEQ ID NO:5: 
[0214] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 
(tx) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 97.. 1389 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
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GTGACCGAGC GGCGCGGACG GCCGCCTGCC CCCTCTGCCA CCTGGGGCGG TGCGGGCCCG 

GAGCCCGGAG CCCGGGTAGC GCGTAGAGCC GGCGCG ATG CAC GTG CGC TCA CTG 

Met His val Arg Ser Leu 
1 5 

CGA GCT GCG GCG CCG CAC AGC TTC GTG GCG CTC TGG GCA CCC CTG TTC 
Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala Pro Leu Phe 
10 15 20 

CTG CTG CGC TCC GCC CTG GCC GAC TTC AGC CTG GAC AAC GAG GTG CAC 
Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn Glu Val His 
25 30 35 

TCG AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG CGG GAG ATG 
Ser Ser Phe lie His Arg Arg Leu Arg Ser Gin Glu Arg Arg Glu Met 
40 45 50 

CAG CGC GAG ATC CTC TCC ATT TTG GGC TTG CCC CAC CGC CCG CGC CCG 
Gin Arg Glu lie Leu Ser He Leu Gly Leu Pro His Arg Pro Arg Pro 
55 60 65 70 
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CAC CTC CAG GGC AAG CAC AAC TCG GCA CCC ATG TTC ATG CTG GAC CTG 3 54 

His Leu Gin Gly Lys His Asn Ser Ala Pro Met Phe Met Leu Asp Leu 
75 80 85 

TAC AAC GCC ATG GCG GTG GAG GAG GGC GGC GGG CCC GGC GGC CAG GGC 4 02 

Tyr Asn Ala Met Ala Val Glu Glu Gly Gly Gly Pro Gly Gly Gin Gly 
90 95 100 

TTC TCC TAC CCC TAC AAG GCC GTC TTC AGT ACC CAG GGC CCC CCT CTG 4 5 0 

Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr Gin Gly Pro Pro Leu 
105 110 115 

GCC AGC CTG CAA GAT AGC CAT TTC CTC ACC GAC GCC GAC ATG GTC ATG 4 98 

Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp Ala Asp Met Val Met 
120 125 130 

AGC TTC GTC AAC CTC GTG GAA CAT GAC AAG GAA TTC TTC CAC CCA CGC 54 6 

Ser Phe val Asn Leu Val Glu His Asp Lys Glu Phe Phe His Pro Arg 
135 140 145 150 

TAC CAC CAT CGA GAG TTC CGG TTT GAT CTT TCC AAG ATC CCA GAA GGG 594 
Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser Lys He Pro Glu Gly 
155 160 165 

GAA GCT GTC ACG GCA GCC GAA TTC CGG ATC TAC AAG GAC TAC ATC CGG 64 2 

Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp Tyr He Arg 
170 175 180 

GAA CGC TTC GAC AAT GAG ACG TTC CGG ATC AGC GTT TAT CAG GTG CTC 690 
Glu Arg Phe Asp Asn Glu Thr Phe Arg He Ser Val Tyr Gin Val Leu 
185 190 195 

CAG GAG CAC TTG GGC AGG GAA TCG GAT CTC TTC CTG CTC GAC AGC CGT 7 38 

Gin Glu His Leu Gly Arg Glu Ser Asp Leu Phe Leu Leu Asp Ser Arg 
200 205 210 

30 ACC CTC TGG GCC TCG GAG GAG GGC TGG CTG GTG TTT GAC ATC ACA GCC 786 

Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu Val Phe Asp He Thr Ala 
215 220 225 230 



20 



25 



35 



40 



45 



50 



55 



ACC AGC AAC CAC TGG GTG GTC AAT CCG CGG CAC AAC CTG GGC CTG CAG 8 34 

Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn Leu Gly Leu Gin 
235 240 245 

CTC TCG GTG GAG ACG CTG GAT GGG CAG AGC ATC AAC CCC AAG TTG GCG 88 2 

Leu Ser Val Glu Thr Leu Asp Gly Gin Ser He Asn Pro Lys Leu Ala 
250 255 260 

GGC CTG ATT GGG CGG CAC GGG CCC CAG AAC AAG CAG CCC TTC ATG GTG 930 

Gly Leu He Gly Arg His Gly Pro Gin Asn Lys Gin Pro Phe Met Val 
265 270 275 

GCT TTC TTC AAG GCC ACG GAG GTC CAC TTC CGC AGC ATC CGG TCC ACG 978 
Ala Phe Phe Lys Ala Thr Glu Val His Phe Arg Ser He Arg Ser Thr 
280 285 290 

GGG AGC AAA CAG CGC AGC CAG AAC CGC TCC AAG ACG CCC AAG AAC CAG 102 6 

Gly Ser Lys Gin Arg ser Gin Asn Arg Ser Lys Thr Pro Lys Asn Gin 

295 300 305 3 lO 
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GAA GCC CTG CGG ATG GCC AAC GTG GCA GAG AAC AGC AGC AGC GAC CAG 1074 
Glu Ala Leu Arg Met Ala Asn Val Ala Glu Asn Ser Ser Ser Asp Gin 
315 320 325 

AGG CAG GCC TGT AAG AAG CAC GAG CTG TAT GTC AGC TTC CGA GAC CTG 1122 
Arg Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu 
330 335 340 

GGC TGG CAG GAC TGG ATC ATC GCG CCT GAA GGC TAC GCC GCC TAC TAC 117 0 

Gly Trp Gin Asp Trp lie lie Ala Pro Glu Gly Tyr Ala Ala Tyr Tyr 
345 350 355 

TGT GAG GGG GAG TGT GCC TTC CCT CTG AAC TCC TAC ATG AAC GCC ACC 1218 
Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala Thr 
360 365 370 

AAC CAC GCC ATC GTG CAG ACG CTG GTC CAC TTC ATC AAC CCG GAA ACG ' 12 66 

Asn His Ala lie Val Gin Thr Leu Val His Phe lie Asn Pro Glu Thr 
375 380 385 390 

GTG CCC AAG CCC TGC TGT GCG CCC ACG CAG CTC AAT GCC ATC TCC GTC 13 14 

Val Pro Lys Pro cys Cys Ala Pro Thr Gin Leu Asn Ala He Ser val 
395 400 405 

CTC TAC TTC GAT GAC AGC TCC AAC GTC ATC CTG AAG AAA TAC AGA AAC 13 62 

Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn 
410 415 420 

ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCCTCC GAGAATTCAG 14 09 

Met Val Val Arg Ala Cys Gly Cys His 
425 430 

ACCCTTTGGG GCCAAGTTTT TCTGGATCCT CCATTGCTC 14 4 8 

(2) INFORMATION FOR SEQ ID NO:6: 
[0215] 

(i) SEQUENCE CHARACTERISTICS: 

35 (A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



20 



30 



40 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 



Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 

45 1 5 10 15 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 

20 25 30 

Leu Asp Asn Glu Val His Ser Ser Phe lie His Arg Arg Leu Arg Ser 

50 3 5 4 0 4 5 

Gin Glu Arg Arg Glu Met Gin Arg Glu He Leu Ser Xle Leu Gly Leu 

50 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 
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65 70 75 80 

Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly 
85 90 95 

Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser 
100 105 110 

Thr Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr 
115 120 125 

Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys 
130 135 140 

Glu Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu 
145 150 155 160 

Ser Lys lie Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie 
165 170 175 

Tyr Lys Asp Tyr lie Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg He 
180 185 190 

Ser Val Tyr Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu 
195 ' 200 205 

Phe Leu Leu Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu 
210 215 220 

Val Phe Asp He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg 
225 230 235 240 

His Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser 
245 250 255 

He Asn Pro Lys Leu Ala Gly Leu lie Gly Arg His Gly Pro Gin Asn 
260 265 270 

35 Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe 

275 280 285 

Arg Ser He Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser 
290 295 300 

40 Lys Thr Pro Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu 

305 310 315 320 

Asn Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr 
325 330 335 

45 Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu 

340 345 350 

Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn 
355 360 365 



25 



30 



50 



Ser Tyr Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His 
370 375 380 

Phe He Asn Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin 
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385 390 395 400 

Leu Asn Ala He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val He 
405 410 415 

5 

Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 430 

(2) INFORMATION FOR SEQ ID NO:7: 

10 

[0216] 

(i) SEQUENCE CHARACTERISTICS: 

15 (A) LENGTH: 2923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

20 (ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

25 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: Human placenta 

(vii) IMMEDIATE SOURCE: 

30 

(A) LIBRARY: Stratagene catalog #936203 Human placenta cDNA library 

(B) CLONE: BMP6C35 

(viii) POSITION IN GENOME: 

35 

(C) UNITS: bp 

(ix) FEATURE: 

40 (A) NAME/KEY: CDS 

(B) LOCATION: 160.. 1701 

(ix) FEATURE: 

45 (A) NAME/KEY: mat_peptide 

(B) LOCATION: 1282.. 1698 

(ix) FEATURE: 

50 (A) NAME/KEY: mRNA . 

(B) LOCATION: 1..2923 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

55 
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10 



25 



CGACCATGAG AGATAAGGAC TGAGGGCCAG GAAGGGGAAG CGAGCCCGCC GAGAGGTGGC SO 
GGGGACTGCT CACGCCAAGG GCCACAGCGG CCGCGCTCCG GCCTCGCTCC GCCGCTCCAC 12 0 

GCCTCGCGGG ATCCGCGGGG GCAGCCCGGC CGGGCGGGG ATG CCG GGG CTG GGG 17 4 

Met Pro Gly Leu Gly 

-374 -370 

CGG AGG GCG CAG TGG CTG TGC TGG TGG TGG GGG CTG CTG TGC AGC TGC 222 
Arg Arg Ala Gin Trp Leu Cys Trp Trp Trp Gly Leu Leu Cys Ser cys 
-365 -360 -355 

TGC GGG CCC CCG CCG CTG CGG CCG CCC TTG CCC GCT GCC GCG GCC GCC 2 70 



35 



42 
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Cys 


Gly 


Pro 


Pro Pro 
-350 


Leu 


Arg 


Pro 


Pro Leu 
-345 


Pro 


Ala 


Ala 


Ala Ala 
-340 


Ala 




5 


GCC 
Ala 


GCC 
Ala 


GGG GGG 
Gly Gly 
-335 


CAG 

Gin 


CTG 
Leu 


CTG 
Leu 


GGG GAC 
Gly Asp 
-330 


GGC 
Gly 


GGG 

Gly 


AGC 
Ser 


CCC 
Pro 

-32! 


GGC 
Gly 

5 


CGC 

Arg 


ACG 

Thr 


318 




GAG 
Glu 


CAG CCG 
Gin Pro 
-320 


CCG 
Pro 


CCG 
Pro 


TCG 
Ser 


CCG CAG 
Pro Gin 
-315 


TCC 
Ser 


TCC 
Ser 


TCG 
Ser 


GGC TTC 
Gly Phe 
-310 


CTG 
Leu 


TAC 

Tyr 


CGG 
Arg 


366 


10 


CGG CTC 
Arg Leu 
-305 


AAG 
Lys 


ACG 
Thr 


CAG 
Gin 


GAG AAG 
Glu Lys 
-300 


CGG 
Arg 


GAG 
Glu 


ATG 
Met 


CAG AAG 
Gin Lys 
-295 


GAG 
GlU 


ATC 
He 


TTG 
Leu 


TCG 
Ser 
-290 


4 14 


15 


GTG 

val 


CTG 
Leu 


GGG 
Gly 


CTC 
Leu 


CCG CAC 
Pro His 
-285 


CGG 
Arg 


CCC 

Pro 


CGG 

Arg 


CCC CTG 
Pro Leu 
-280 


CAC 
His 


GGC 

Gly 


CTC 
Leu 


CAA CAG 
Gin Gin 

-275 


462 




CCG 
Pro 


CAG 
Gin 


CCC 
Pro 


CCG GCG 
Pro Ala 
-270 


CTC 
Leu 


CGG 
Arg 


CAG 
Gin 


CAG GAG 
Gin Glu 
-265 


GAG 
GlU 


CAG 
Gin 


CAG 
Gin 


CAG CAG 
Gin Gin 
-260 


CAG 
Gin 


510 


20 


CAG 
Gin 


CTG 
Leu 


CCT CGC 
Pro Arg 
-255 


GGA 
Gly 


GAG 
Glu 


CCC 
Pro 


CCT CCC 
Pro Pro 
-250 


GGG 

Gly 


CGA 
Arg 


CTG 
Leu 


AAG TCC 
Lys Ser 
-245 


GCG 
Ala 


CCC 

Pro 


558 


25 


CTC 
Leu 


TTC ATG 
Phe Met 
-240 


CTG 
Leu 


GAT 
Asp 


CTG 
Leu 


TAC AAC 
Tyr Asn 
-235 


GCC 
Ala 


CTG 
Leu 


TCC 
Ser 


GCC GAC 
Ala Asp 
-230 


AAC 
Asn 


GAC 
Asp 


GAG 
GlU 


606 




GAC GGG 
Asp Gly 
-225 


GCG 
Ala 


TCG 
Ser 


GAG 
Glu 


GGG GAG 
Gly Glu 
-220 


AGG 
Arg 


CAG 
Gin 


CAG 
Gin 


TCC TGG 
Ser Trp 
-215 


CCC 
Pro 


CAC 
His 


GAA 
GlU 


GCA 

Ala 
-210 


654 


30 


GCC 
Ala 


AGC 
ser 


TCG 
Ser 


TCC 
Ser 


CAG CGT 
Gin Arg 
-205 


CGG 
Arg 


CAG 
Gin 


CCG 
Pro 


CCC CCG 
Pro Pro 
-200 


GGC 
Gly 


GCC 
Ala 


GCG 
Ala 


CAC CCG 
His Pro 
-195 


702 


35 


CTC 
Leu 


AAC 
Asn 


CGC 
Arg 


AAG AGC 
Lys Ser 
-190 


CTT 
Leu 


CTG 
Leu 


GCC 
Ala 


CCC GGA 

Pro Gly 
-185 


TCT 
Ser 


GGC 
Gly 


AGC 
Ser 


GGC GGC 
Gly Gly 
-180 


GCG 
Ala 


750 


TCC 
Ser 


CCA 

Pro 


CTG ACC 
Leu Thr 
-175 


AGC 

Ser 


GCG 
Ala 


CAG 
Gin 


GAC AGC 
Asp Ser 

-170 


GCC 
Ala 


TTC 
Phe 


CTC 
Leu 


AAC GAC 
Asn Asp 
-165 


GCG 
Ala 


GAC 
Asp 


798 


40 


ATG 
Met 


GTC ATG 
Val Met 
-160 


AGC 
Ser 


TTT 
Phe 


GTG 
Val 


AAC CTG 
Asn Leu 
-155 


GTG 
Val 


GAG 
Glu 


TAC 
Tyr 


GAC AAG 
Asp Lys 
-150 


GAG 
Glu 


TTC 
Phe 


TCC 
Ser 


846 




CCT CGT 
Pro Arg 
-145 


CAG 
Gin 


CGA 
Arg 


CAC 
His 


CAC AAA 

His Lys 
-140 


GAG 
Glu 


TTC 
Phe 


AAG 

Lys 


TTC 
Phe 
-135 


AAC 
Asn 


TTA 

Leu 


TCC 
Ser 


CAG 

Gin 


ATT 

He 
-130 


894 


45 


CCT 
Pro 


GAG 
Glu 


GGT 
Gly 


GAG 
Glu 


GTG GTG 

Val val 

-125 


ACG 
Thr 


GCT 
Ala 


GCA 
Ala 


GAA 
Glu 
-120 


TTC 
Phe 


CGC 
Arg 


ATC 
lie 


TAC 
Tyr 


AAG 

Lys 

-115 


GAC 
Asp 


942 




TGT 


GTT 


ATG 


GGG 


AGT 


TTT 


AAA 


AAC 


CAA 


ACT 


TTT 


CTT 


ATC 


AGC 


ATT 


TAT 


990 



50 



55 
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10 



20 



25 



35 



Cys val Met Gly Ser Phe Lys Asn Gin Thr Phe Leu He ser He Tyr 
-110 -105 -100 

CAA GTC TTA CAG GAG CAT CAG CAC AGA GAC TCT GAC CTG TTT TTG TTG l038 
Gin Val Leu Gin Glu His Gin His Arg Asp Ser Asp Leu Phe Leu Leu 
-95 -90 "85 

GAC ACC CGT GTA GTA TGG GCC TCA GAA GAA GGC TGG CTG GAA TTT GAC 108 6 

Asp Thr Arg Val val Trp Ala Ser Glu Glu Gly Trp Leu Glu Phe Asp 
-80 -75 -70 

ATC ACG GCC ACT AGC AAT CTG TGG GTT GTG ACT CCA CAG CAT AAC ATG 11^4 
lie Thr Ala Thr Ser Asn Leu Trp Val Val Thr Pro Gin His Asn Met 
-65 -60 -55 -50 

GGG CTT CAG CTG AGC GTG GTG ACA AGG GAT GGA GTC CAC GTC CAC CCC 1182 
Gly Leu Gin Leu Ser Val Val Thr Arg Asp Gly Val His Val His Pro 
-45 -40 -35 

CGA GCC GCA GGC CTG GTG GGC AGA GAC GGC CCT TAG GAT AAG CAG CCC 12 3 0 

Arg Ala Ala Gly Leu Val Gly Arg Asp Gly Pro Tyr Asp Lys Gin Pro 
-30 -25 -20 

TTC ATG GTG GCT TTC TTC AAA GTG AGT GAG GTC CAC GTG CGC ACC ACC 12 7 8 

Phe Met Val Ala Phe pfte Lys val Ser Glu Val His Val Arg Thr Thr 
-15 -10 -5 

AGG TCA GCC TCC AGC CGG CGC CGA CAA CAG AGT CGT AAT CGC TCT ACC 13 2 6 

Arg Ser Ala Ser Ser Arg Arg Arg Gin Gin Ser Arg Asn Arg Ser Thr 
1 5 10 15 

CAG TCC CAG GAC GTG GCG CGG GTC TCC AGT GCT TCA GAT TAC AAC AGC 1374 
Gin Ser Gin Asp Val Ala Arg Val Ser Ser Ala Ser Asp Tyr Asn Ser 
20 25 30 

AGT GAA TTG AAA ACA GCC TGC AGG AAG CAT GAG CTG TAT GTG AGT TTC 14 22 

Ser Glu Leu Lys Thr Ala Cys Arg Lys His Glu Leu Tyr val Ser Phe 
35 40 45 

CAA GAC CTG GGA TGG CAG GAC TGG ATC ATT GCA CCC AAG GGC TAT GCT 14 70 

Gin Asp Leu Gly Trp Gin Asp Trp lie He Ala Pro Lys Gly Tyr Ala 
50 55 60 

GCC AAT TAC TGT GAT GGA GAA TGC TCC TTC CCA CTC AAC GCA CAC ATG 15 18 

Ala Asn Tyr Cys Asp Gly Glu Cys Ser Phe Pro Leu Asn Ala His Met 
65 70 75 

AAT GCA ACC AAC CAC GCG ATT GTG CAG ACC TTG GTT CAC CTT ATG AAC 1566 
Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His Leu Met Asn 
80 85 90 95 

CCC GAG TAT GTC CCC AAA CCG TGC TGT GCG CCA ACT AAG CTA AAT GCC 1614 
Pro Glu Tyr Val Pro Lys Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala 
100 105 110 

ATC TCG GTT CTT TAC TTT GAT GAC AAC TCC AAT GTC ATT CTG AAA AAA 16 62 

He Ser Val Leu Tyr Phe Asp Asp Asn ser Asn Val He Leu Lys Lys 
115 120 125 

TAC AGG AAT ATG GTT GTA AGA GCT TGT GGA TGC CAC TAACT CG AAA 17 08 
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Tyr Arg Asn Met Val Val Arg Ala Cys Gly cys His 

130 135 140 



5 



w 



15 



20 



25 



35 



CCAGATGCTG 


GGGACACACA 


TTCTGCCTTG 


GATTCCTAGA 


TTACATCTGC 


CTTAAAAAAA 


1768 


CACGGAAGCA 


CAGTTGGAGG 


TGGGACGATG 


AGACTTTGAA 


ACTATCTCAT 


GCCAGTGCCT 


1826 


TATTACCCAG 


GAAGATTTTA 


AAGGACCTCA 


TTAATAATTT 


GCTCACTTGG 


TAAATGACGT 


1S88 


GAGTAGTTGT 


TGGTCTGTAG 


CAAGCTGAGT 


TTGGATGTCT 


GT AG CATAAG 


GTCTGGTAAC 


1948 


TGCAGAAACA 


TAACCGTGAA 


GCTCTTCCTA 


CCCTCCTCCC 


CCAAAAACCC 


ACCAAAATTA 


2008 


GTTTTAGCTG 


TAGATCAAGC 


TATTTGGGGT 


GTTTGTTAGT 


AAATAGGGAA 


AATAATCTCA 


2068 


AAGGAGTTAA 


ATGTATTCTT 


GGCTAAAGGA 


TCAGCTGGTT 


CAGTACTGTC 


TATCAAAGGT 


2128 


AGATTTTACA 


GAGAACAGAA 


ATCGGGGAAG 


TGGGGGGAAC 


GCCTCTGTTC 


AGTTCATTCC 


2188 


CAGAAGTCCA 


CAGGACGCAC 


AGCCCAGGCC 


ACAGCCAGGG 


CTCCACGGGG 


CGCCCTTGTC 


22 4 8 


TCAGTCATTG 


CTGTTGTATG 


TTCGTGCTGG 


AGTTTTGTTG 


GTGTGAAAAT 


ACACTTATTT 


2308 


CAGCCAAAAC 


ATACCATTTC 


TACACCTCAA 


TCCTCCATTT 


GCTGTACTCT 


TTGCTAGTAC 


2368 


CAAAAGTAGA 


CTGATTACAC 


TGAGGTGAGG 


CTACAAGGGG 


TGTGTAACCG 


TGTAACACGT 


2428 


GAAGGCAGTG 


CTCACCTCTT 


CTTTACCAGA 


ACGGTTCTTT 


GACCAGCACA 


TTAACTTCTG 


2488 


GACTGCCGGC 


TCTAGTACCT 


TTTCAGTAAA 


GTGGTTCTCT 


GCCTTTTTAC 


TATACAGCAT 


2548 


ACCACGCCAC 


AGGGTTAGAA 


CCAACGAAGA 


AAATAAAATG 


AGGGTGCCCA 


GCTTATAAGA 


2600 


ATGGTGTTAG 


GGGGATGAGC 


ATGCTGTTTA 


TG AA CG G AAA 


TCATGATTTC 


C CTG T AG AAA 


2668 


GTGAGGCTCA 


GATTAAATTT 


TAG AA TATTT 


TCTAAATGTC 


TTTTTCACAA 


TCATGTGACT 


2728 


GGGAAGGCAA 


TTT CAT ACT A 


AACTGATTAA 


ATAATACATT 


TATAATCTAC 


AACTGTTTGC 


2788 


ACTTACAGCT 


TTTTTTGTAA 


ATATAAACTA 


TAATTTATTG 


TCTATTTTAT 


AfCTGTTTTG 


2848 


CTGTGGCGTT 


GGGCGGGGGG 


CCGGG CTTTT 


GGGGGGGGGG 


GTTTGTTTGG 


GGGGTGTCGT 


2908 


GGTGTGGGCG 


GGCGG 










2923 



(2) INFORMATION FOR SEQ ID NO:8: 

40 

[0217] 

(i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 513 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 



Met Pro Gly Leu Gly Arg Arg Ala Gin Trp Leu Cys Trp Trp Trp Gly 
55 ~374 -370 -365 -360 
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10 



Leu Leu Cys Ser Cys Cys Gly Pro Pro Pro Leu Arg Pro Pro Leu Pro 
-355 -350 -345 

Ala Ala Ala Ala Ala Ala Ala Gly Gly Gin Leu Leu Gly Asp Gly Gly 
-340 -335 -330 

Ser Pro Gly Arg Thr Glu Gin Pro Pro Pro Ser Pro Gin Ser Ser Ser 
-325 -320 -315 

Gly Phe Leu Tyr Arg Arg Leu Lys Thr Gin Glu Lys Arg Glu Met Gin 
-310 -305 -300 -295 

Lys Glu lie Leu Ser Val Leu Gly Leu Pro His Arg Pro Arg Pro Leu 
-290 -285 -280 

15 His Gly Leu Gin Gin Pro Gin Pro Pro Ala Leu Arg Gin Gin Glu Glu 

-275 -270 -265 

Gin Gin Gin Gin Gin Gin Leu Pro Arg Gly Glu Pro Pro Pro Gly Arg 
-260 -255 -250 

20 Leu Lys Ser Ala Pro Leu Phe Met Leu Asp Leu Tyr Asn Ala Leu Ser 

-245 " -240 -235 

Ala Asp Asn Asp Glu As^p Gly Ala Ser Glu Gly Glu Arg Gin Gin Ser 
-230 -225 -220 -215 

25 Trp Pro His Glu Ala Ala Ser Ser Ser Gin Arg Arg Gin Pro Pro Pro 

-210 -205 -200 

Gly Ala Ala His Pro Leu Asn Arg Lys Ser Leu Leu Ala Pro Gly Ser 
-195 -190 -105 

30 Gly Ser Gly Gly Ala Ser Pro Leu Thr Ser Ala Gin Asp Ser Ala Phe 

-180 -175 -170 

Leu Asn Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu Tyr 
-165 -160 -155 

35 Asp Lys Glu Phe Ser Pro Arg Gin Arg His His Lys Glu Phe Lys Phe 

-150 -145 -140 -135 

Asn Leu ser Gin lie Pro Glu Gly Glu Val val Thr Ala Ala Glu Phe 
-130 -125 -120 

40 Arg lie Tyr Lys Asp Cys val Met Gly Ser Phe Lys Asn Gin Thr Phe 

-115 -110 -105 

Leu lie Ser He Tyr Gin Val Leu Gin Glu His Gin His Arg Asp Ser 
-100 -95 -90 

45 Asp Leu Phe Leu Leu Asp Thr Arg Val val Trp Ala Ser Glu Glu Gly 

-85 -80 -75 

Trp Leu Glu Phe Asp lie Thr Ala Thr Ser* Asn Leu Trp Val Val Thr 
-70 -65 -60 -55 

50 

Pro Gin His Asn Met Gly Leu Gin Leu Ser Val Val Thr Arg Asp Gly 
-50 -45 -40 



55 
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Val His Val His Pro Arg Ala Ala Gly Leu Val Gly Arg Asp Gly Pro 

-35 -30 -25 

Tyr Asp Lys Gin Pro Phe Met Val Ala Phe Phe Lys Val Ser Glu Val 
-20 -15 -10 

His Val Arg Thr Thr Arg Ser Ala Ser Ser Arg Arg Arg Gin Gin Ser 
-5 15 10 

Arg Asn Arg Ser Thr Gin Ser Gin Asp Val Ala Arg Val Ser Ser Ala 
15 20 25 

Ser Asp Tyr Asn Ser Ser Glu Leu Lys Thr Ala Cys Arg Lys His Glu 
30 35 40 

Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Gin Asp Trp lie lie Ala 
45 50 55 

Pro Lys Gly Tyr Ala Ala Asn Tyr cys Asp Gly Glu Cys Ser Phe Pro 
60 65 70 

Leu Asn Ala His Met Asn Ala Thr Asn His Ala lie Val Gin Thr Leu 
75 80 85 90 

Val His Leu Met Asn Pro Glu Tyr Val Pro Lys Pro Cys cys Ala Pro 
95 100 105 

Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe Asp Asp Asn Ser Asn 
110 H5 120 

Val He Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys 
125 130 135 

His 



(2) INFORMATION FOR SEQ ID NO:9: 
[0218] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(iit) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(H) CELL LINE: U2-OS osteosarcoma 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: U2-OS human osteosarcoma cDNA library 

(B) CLONE: U2-16 

(viii) POSITION IN GENOME: 
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(C) UNITS: bp 

(ix) FEATURE: 

5 (A) NAME/KEY: CDS 

(B) LOCATION: 699. .2063 

(ix) FEATURE: 

10 (A) NAME/KEY: mat_peptide 

(B) LOCATION: 1647. 2060 

(ix) FEATURE: 

15 (A) NAME/KEY: mRNA 

(B) LOCATION: 1..2153 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

20 
25 
30 
35 
40 
45 
50 
55 
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5 



15 



CTGGTATATT 


TGTGCCTGCT 


GGAGGTGGAA 


TTAACAGTAA 


GAAGGAGAAA 


GGGATTGAAT 


60 


GG ACTTACAG 


GAAGGATTTC 


AAGTAAATTC 


AGGGAAACAC 


ATTTACTTGA 


ATAGTACAAC 


120 


CTAGAGTATT 


ATTTTACACT 


AAGACGACAC 


AAAAGATGTT 


AAAGTTATCA 


CCAAGCTGCC 


180 


GG AC AG AT AT 


ATATTCCAAC 


ACCAAGGTGC 


AGATCAGCAT 


AGATCTGTGA 


TTCAGAAATC 


240 


AGGATTTGTT 


TTGGAAAGAG 


CTCAAGGGTT 


GAGAAGAACT 


CAAAAGCAAG 


TGAAGATTAC 


3O0 


TTTGGGAACT 


ACAGTTTATC 


AGAAGATCAA 


C TTTTG CTAA 


TTCAAATACC 


AAAGGCCTGA 


360 


TTATCATAAA 


TTCATATAGG 


AATGCATAGG 


TCATCTGATC 


AAATAATATT 


AGCCGTCTTC 


420 


TGCTACATCA 


ATG CAGC AAA 


AACTCTTAAC 


AACTGTGGAT 


AATTGGAAAT 


CTGAGTTTCA 


480 


GCTTTCTTAG 


AAATAACTAC 


TCTTGACATA 


TTCCAAAATA 


TTTAAAATAG 


GACAGGAAAA 


540 


TCGGTGAGGA 


TGTTGTGCTC 


AGAAATGTCA 


CTGTCATGAA 


AAATAGGTAA 


ATTTGTTTTT 


600 


TCAGCTACTG 


GGAAACTGTA 


CCTCCTAGAA 


CCTTAGGTTT 


TTTTTTTTTT 


AAGAGGACAA 


660 


GAAGGACTAA 


AAATATCAAC 


TTTTG CTTTT 


GGACAAAA ATG CAT CTG ACT GTA 


713 



Met His Leu Thr val 
-316-315 

TTT TTA CTT AAG GGT ATT GTG GGT TTC CTC TGC AGC TGC TGG GTT CTA 761 
Phe Leu Lea Lys Gly lie Val Gly Phe Leu Trp Ser Cys Trp Val Leu 
-310 -305 -300 

25 

GTG GGT TAT GCA AAA GGA GGT TTG GGA GAC AAT CAT GTT CAC TCC AGT S09 
Val Gly Tyr Ala Lys Gly Gly Leu Gly Asp Asn His Val His Ser Ser 
-295 -290 -285 -280 

TTT ATT TAT AGA AGA CTA CGG AAC CAC GAA AGA CGG GAA ATA CAA AGG 6 57 

30 Phe lie Tyr Arg Arg Leu Arg Asn His Glu Arg Arg Glu He Gin Arg 

-275 -270 -265 



GAA ATT CTC TCT ATC TTG 
Glu He Leu Ser He Leu 
-260 

35 

CCT GGA AAA ATG ACC AAT 
Pro Gly Lys Met Thr Asn 
-245 

GAT CTC TAC AAT GCC GAA 

40 



GGT TTG CCT CAC AGA CCC 
Gly Leu Pro His Arg Pro 
-255 

CAA GCG TCC TCT GCA CCT 
Gin Ala Ser Ser Ala pro 
-240 

GAA AAT CCT GAA GAG TCG 



AGA CCA TTT TCA 90 5 

Arg Pro Phe ser 
-250 

CTC TTT ATG CTG 95 3 

Leu Phe Met Leu 

-235 

GAG TAC TCA GTA 1001 



45 



50 



55 
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10 



15 



20 



25 



30 



Asp Leu Tyr Asn Ala Cilu Glu Asn Pro Glu Glu Ser Glu Tyr Ser Val 
-230 -225 -220 

AGG GCA TCC TTG GCA GAA GAG ACC AGA GGG GCA AG A AAG GGA TAC CCA 1049 
Arg Ala Ser Leu Ala Glu Glu Thr Arg Gly Ala Arg Lys Gly Tyr Pro 
-215 -210 -205 -200 

GCC TCT CCC AAT GGG TAT CCT CGT CGC ATA CAG TTA TCT CGG ACG ACT 10 97 

Ala Ser Pro Asn Gly Tyr Pro Arg Arg lie Gin Leu Ser Arg Thr Thr 
-195 -190 -185 

CCT CTG ACC ACC CAG AGT CCT CCT CTA GCC AGC CTC CAT GAT ACC AAC 114 5 

Pro Leu Thr Thr Gin Ser Pro Pro Leu Ala Ser Leu His Asp Thr Asn 
-180 -175 -170 

TTT CTG AAT GAT GCT GAC ATG GTC ATG AGC TTT GTC AAC TTA GTT GAA 119 3 

Phe Leu Asn Asp Ala Asp Met Val Met Ser Phe Val Asn Leu val Glu 

-165 -160 -155 

AGA GAC AAG GAT TTT TCT CAC CAG CGA AGG CAT TAC AAA GAA TTT CGA 12 41 

Arg Asp Lys Asp Phe Ser His Gin Arg Arg His Tyr Lys Glu Phe Arg 
-150 -145 -140 

TTT GAT CTT ACC CAA ATT CCT CAT GGA GAG GCA GTG ACA GCA GCT GAA 12 8 9 

Phe Asp Leu Thr Gin lie Pro His Gly Glu Ala Val Thr Ala Ala Glu 
-135 -130 -125 -120 

TTC CGG ATA TAC AAG GAC CGG AGC AAC AAC CGA TTT GAA AAT GAA ACA 133 7 

Phe Arg He Tyr Lys Asp Arg Ser Asn Asn Arg Phe Glu Asn Glu Thr 
-115 -110 -105 

ATT AAG ATT AGC ATA TAT CAA ATC ATC AAG GAA TAC ACA AAT AGG GAT 138 5 

He Lys He Ser He Tyr Gin He He Lys Glu Tyr Thr Asn Arg Asp 
-100 -95 ~9 0 

GCA GAT CTG TTC TTG TTA GAC ACA AGA AAG GCC CAA GCT TTA GAT GTG 14 3 3 

Ala Asp Leu Phe Leu Leu Asp Thr Arg Lys Ala Gin Ala Leu Asp Val 
-85 -80 -75 

GGT TGG CTT GTC TTT GAT ATC ACT GTG ACC AGC AAT CAT TGG GTG ATT 148 1 

Gly Trp Leu Val Phe Asp He Thr Val Thr Ser Asn His Trp Val He 
-70 -65 -60 

AAT CCC CAG AAT AAT TTG GGC TTA CAG CTC TGT GCA GAA ACA GGG GAT 152 9 

Asn Pro Gin Asn Asn Leu Gly Leu Gin Leu Cys Ala Glu Thr Gly Asp 
-55 -50 -45 -40 

GGA CGC AGT ATC AAC GTA AAA TCT GCT GGT CTT GTG GGA AGA CAG GGA 157 7 

Gly Arg Ser He Asn Val Lys Ser Ala Gly Leu Val Gly Arg Gin Gly 
-35 -30 -25 

CCT CAG TCA AAA CAA CCA TTC ATG GTG GCC TTC TTC AAG GCG AGT GAG 162 5 

Pro Gin Ser Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Ser Glu 
-20 -15 -10 

GTA CTT CTT CGA TCC GTG AGA GCA GCC AAC AAA CGA AAA AAT CAA AAC 167 3 

Val Leu Leu Arg Ser Val Arg Ala Ala Asn Lys Arg Lys Asn Gin Asn 
-5 15 

CGC AAT AAA TCC AGC TCT CAT CAG GAC TCC TCC AGA ATG TCC AGT GTT 17 2 1 
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10 



15 



20 



30 



Arg Asn Lys Ser Ser Ser His Gin Asp Ser Ser Arg Met Ser Ser Val 
10 15 20 25 

GGA GAT TAT AAC ACA AGT GAG CAA AAA CAA GCC TGT AAG AAG CAC GAA 17 69 

Gly Asp Tyr Asn Thr Ser Glu Gin Lys Gin Ala Cys Lys Lys His Glu 
30 35 40 

CTC TAT GTG AGC TTC CGG GAT CTG GGA TGG CAG GAC TGG ATT ATA GCA 1817 
Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp lie lie Ala 
45 50 55 

CCA GAA GGA TAC GCT GCA TTT TAT TGT GAT GGA GAA TGT TCT TTT CCA 18 65 

Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Gly Glu Cys Ser Phe Pro 
60 65 70 

CTT AAC GCC CAT ATG AAT GCC ACC AAC CAC GCT ATA GTT CAG ACT CTG 1913 
Leu Asn Ala His Met Asn Ala Thr Asn His Ala lie Val Gin Thr Leu 
75 80 85 

GTT CAT CTG ATG TTT CCT GAC CAC GTA CCA AAG CCT TGT TGT GCT CCA 1961 
Val His Leu Met Phe Pro Asp His Val Pro Lys Pro Cys Cys Ala Pro 
90 95 100 105 

ACC AAA TTA AAT GCC ATC TCT GTT CTG TAC TTT GAT GAC AGC TCC AAT 2 009 

Thr Lys Leu Asn Ala lie Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn 
110 115 120 

GTC ATT TTG AAA AAA TAT AGA AAT ATG GTA GTA CGC TCA TGT GGC TCC 2 057 

Val He Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ser Cys Gly Cys 
125 130 135 

CAC TAATATTAAA TAATATTGAT AATAACAAAA AGATCTGTAT TAAGGTTTAT 2110 
His 

GG CTG CAATA AAAAG CAT AC TTTCAGACAA ACAGAAAAAA AAA 215 3 



(2) INFORMATION FOR SEQ ID NO:10: 
35 [0219] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 
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Met His Leu Thr Val Phe Leu Leu Lys Gly lie Val Gly Phe Leu Trp 
-316 -315 "310 -305 

Ser Cvs Trp Val Leu Val Gly Tyr Ala Lys Gly Gly Leu Gly Asp Asn 
! 3 00 "295 "290 -285 

His Val His Ser Ser Phe lie Tyr Arg Arg Leu Arg Asn His Glu Arg 
-280 "275 -270 

Aro Glu He Gin Arg Glu He Leu Ser He Leu Gly Leu Pro His Arg 
y - 265 -260 "255 
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Pro Arg Pro Phe Ser Pro Gly Lys Met Thr Asn Gin Ala Ser Ser Ala 
-250 -245 -240 

Pro Leu Phe Met Leu Asp Leu Tyr Asn Ala Glu Glu Asn Pro Glu Glu 
-235 -230 -225 

Ser Glu Tyr Ser Val Arg Ala Ser Leu Ala Glu Glu Thr Arg Gly Ala 
-220 -215 -210 -205 

10 Arg Lys Gly Tyr Pro Ala Ser Pro Asn Gly Tyr Pro Arg Arg lie Gin 

-200 -195 -190 

Leu ser Arg Thr Thr Pro Leu Thr Thr Gin Ser Pro Pro Leu Ala Ser 
-185 -180 -175 

15 Leu His Asp Thr Asn Phe Leu Asn Asp Ala Asp Met Val Met Ser Phe 

-170 -165 -160 

Val Asn Leu Val Glu Arg Asp Lys Asp Phe Ser His Gin Arg Arg His 
-155 -150 -145 

20 Tyr Lys Glu Phe Arg Phe Asp Leu Thr Gin lie Pro His Gly Glu Ala 

-140 -135 -130 -125 

val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp Arg Ser Asn Asn Arg 
-120 -115 -110 



25 



35 



40 



45 



50 



55 



Phe Glu Asn Glu Thr He Lys He Ser lie Tyr Gin He He Lys Glu 
-105 -100 -95 

Tyr Thr Asn Arg Asp Ala Asp Leu Phe Leu Leu Asp Thr Arg Lys Ala 
-90 -85 -80 

Gin Ala Leu Asp Val Gly Trp Leu Val Phe Asp He Thr Val Thr Ser 
-75 -70 -65 

Asn His Trp Val He Asn Pro Gin Asn Asn Leu Gly Leu GLn Leu Cys 
-60 -55 -50 -45 

Ala Glu Thr Gly Asp Gly Arg Ser He Asn Val Lys Ser Ala Gly Leu 
-40 -35 -30 

Val Gly Arg Gin Gly Pro Gin Ser Lys Gin Pro Phe Met val Ala Phe 
-25 -20 -15 

Phe Lys Ala ser Glu val Leu Leu Arg Ser Val Arg Ala Ala Asn Lys 
-10 -5 1 

Arg Lys Asn Gin Asn Arg Asn Lys Ser Ser Ser His Gin Asp Ser Ser 
5 10 15 20 

Arg Met Ser ser Val Gly Asp Tyr Asn Thr Ser Glu Gin Lys Gin Ala 
25 30 35 

Cys Lys Lys His Glu Leu Tyr Val ser Phe Arg Asp Leu Gly Trp Gin 
40 45 50 

Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Gly 
55 60 65 
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Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
70 75 80 

Xle Val Gin Thr Leu val His Leu Met Phe Pro Asp His Val Pro Lys 
85 90 95 100 

Pro cys cys Ala Pro Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe 
— * no i* 5 



105 



Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn Met Val Val 



20 



25 



120 125 

Arg Ser Cys Gly Cys His 
135 

(2) INFORMATION FOR SEQ ID NO:11: 
[0220] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 
(F) TISSUE TYPE: Human Heart 

35 (vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Human heart cDNA library stratagene catalog #936208 

(B) CLONE: hH38 

40 (viii) POSITION IN GENOME: 

(C) UNITS: bp 
(ix) FEATURE: 

45 

(A) NAME/KEY: CDS 

(B) LOCATION: 8. .850 



50 



(ix) FEATURE: 

(A) NAME/KEY: maLpeptide 

(B) LOCATION: 427.. 843 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 1..997 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: 



GAATTCC GAG CCC CAT TGG AAG GAG TTC CGC TTT GAC CTG ACC CAG ATC 

Glu Pro His Trp Lys Glu Phe Arg Phe Asp Leu Thr Gin lie 
-139 -135 -130 
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CCG GCT GGG GAG GCG GTC ACA GCT GCG GAG TTC CGG ATT TAC AAG GTG 97 
Pro Ala Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr Lys Val 
-125 -120 -115 -110 

CCC AGC ATC CAC CTG CTC AAC AGG ACC CTC CAC GTC AGC ATG TTC CAG 14 5 

Pro Ser He His Leu Leu Asn Arg Thr Leu His val Ser Met Phe Gin 
-105 -100 - -95 

GTG GTC CAG GAG CAG TCC AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT 19 3 

Val Val Gin Glu Gin Ser Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp 
-90 -85 -80 

CTT CAG ACG CTC CGA GCT GGA GAC GAG GGC TGG CTG GTG CTG GAT GTC 241 

Leu Gin Thr Leu Arg Ala Gly Asp Glu Gly Trp Leu Val Leu Asp Val 
-75 -70 -65 

ACA GCA GCC AGT GAC TGC TGG TTG CTG AAG CGT CAC AAG GAC CTG GGA 289 
Thr Ala Ala Ser Asp Cys Trp Leu Leu Lys Arg His Lys Asp Leu Gly 
-60 -55 -50 

CTC CGC CTC TAT GTG GAG ACT GAG GAT GGG CAC AGC GTG GAT CCT GGC 3 37 

Leu Arg Leu Tyr Val Glu Thr Glu Asp Gly His Ser Val Asp Pro Gly 
-45 -40 -35 -30 

CTG GCC GGC CTG CTG GGT CAA CGG GCC CCA CGC TCC CAA CAG CCT TTC 3 85 

■ Leu Ala Gly Leu Leu Gly Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe 
-25 -20 -15 

GTG GTC ACT TTC TTC AGG GCC AGT CCG AGT CCC ATC CGC ACC CCT CGG 4 33 

Val Val Thr Phe Phe Arg Ala Ser Pro Ser Pro He Arg Thr Pro Arg 
-10 -5 1 

GCA GTG AGG CCA CTG AGG AGG AGG CAG QCG AAG AAA AGC AAC GAG CTG 481 
Ala Val Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu 
5 10 15 

CCG CAG GCC AAC CGA CTC CCA GGG ATC TTT GAT GAC GTC CAC GGC TCC 52 9 

Pro Gin Ala Asn Arg Leu Pro Gly He Phe Asp Asp Val His Gly Ser 
20 25 30 35 

CAC GGC CGG CAG GTC TGC CGT CGG CAC GAG CTC TAC GTC AGC TTC CAG 577 
35 his Gly Arg Gin Val cys Arg Arg His Glu Leu Tyr Val ser Phe Gin 

40 45 50 

GAC CTT GGC TGG CTG GAC TGG GTC ATC GCC CCC CAA GGC TAC TCA GCC 62 5 

Asp Leu Gly Trp Leu Asp Trp Val lie Ala Pro Gin Gly Tyr Ser Ala 
55 60 65 

40 

TAT TAC TGT GAG GGG GAG TGC TCC TTC CCG CTG GAC TCC TGC ATG AAC 67 3 

Tyr Tyr Cys Glu Gly Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn 

70 75 80 

GCC ACC AAC CAC GCC ATC CTG CAG TCC CTG GTG CAC CTG ATG AAG CCA 721 
45 Ala Thr Asn His Ala lie Leu Gin Ser Leu Val His Leu Met Lys Pro 

85 90 95 

AAC GCA GTC CCC AAG GCG TGC TGT GCA CCC ACC AAG CTG AGC GCC ACC 7 69 

Asn Ala Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr 

100 105 110 115 

50 
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25 
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TCT GTG CTC TAC TAT GAC AGC AGC AAC AAC GTC ATC CTG CGC AAG CAC 817 
Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val He Leu Arg Lys His 
120 125 130 

5 CGC AAC ATG GTG GTC AAG GCC TGC GGC TGC CAC TGAGTCAGCC CGCCCAGCCC 8 70 

Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
135 140 

TACTGCAGCC ACCCTTCTCA TCTGGATCGG GCCCTGCAGA GGCAGAAAAC CCTTAAATGC 930 

10 TGTCACAGCT CAAGCAGGAG TGTCAGGGGC CCTCACTCTC GGTG CCTACT TCCTGTCAGG 990 

CTTCTGGGAA TTC 1003 



(2) INFORMATION FOR SEQ ID NO:12: 

15 

[0221] 

(i) SEQUENCE CHARACTERISTICS: 

20 (A) LENGTH: 281 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
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Glu Pro His Trp Lys Glu Phe Arg Phe Asp Leu Thr Gin lie Pro Ala 
139 -135 -130 _ 125 

Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr Lys Val Pro Ser 
" 120 -115 .no 

lie His Leu Leu Asn Arg Thr Leu His Val Ser Met Phe Gin Val Val 
-105 -100 

Gin Glu Gin Ser Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin 
_9 ° -85 -80 

Thr Leu Arg Ala Gly Asp Glu Gly Trp Leu Val Leu Asp Val Jhr Ala 
~ 75 - 70 "65 -so 

Ala Ser Asp Cys Trp Leu Leu Lys Arg His Lys Asp Leu Gly Leu Aro 
"55 -50 _ 45 

Leu Tyr Val Glu Thr Glu Asp Gly His Ser Val Asp Pro Gly Leu Ala 
-40 -35 -30 

Gly Leu Leu Gly Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe Val Val 
-25 -20 -15 

Thr Phe Phe Arg Ala Ser Pro Ser Pro He Arg Thr Pro Arg Ala Val 
~ 10 ~ 5 1 5 

Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu Pro Gin 

10 15 20 

Ala Asn Arg Leu Pro Gly He Phe Asp Asp val His Gly Ser His Gly 
25 30 35 

Arg Gin Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu 



40 45 50 



Gly 


Trp 
55 


Leu 


Asp 


Trp 


Val 


He 
60 


Ala 


Pro 


Gin 


Gly 


Tyr 
65 


Ser 


Ala 


Tyr 


Tyr 


Cys 
70 


Glu 


Gly 


Glu 


Cys 


Ser 
75 


Phe 


Pro 


Leu 


Asp 


Ser 
•80 


Cys 


Met 


Asn 


Ala 


Thr 
85 


Asn 


His 


Ala 


He 


Leu 
90 


Gin 


Ser 


Leu 


Val 


His 
95 


Leu 


Met 


Lys 


Pro 


Asn 
100 


Ala 


Val 


Pro 


Lys 


Ala 
105 


Cys 


Cys 


Ala 


Pro 


Thr 
110 


Lys 


Leu 


Ser 


Ala 


Thr 
115 


Ser 


Val 


Leu 


Tyr 


Tyr 
120 


Asp 


Ser 


Ser 


Asn 


Asn 
125 


Val 


He 


Leu 


Arg 


Lys His 
130 


Arg 


Asn 


Met 


Val 
135 


Val 


Lys 


Ala 


Cys 


Gly 
140 


Cys 


His 
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(2) INFORMATION FOR SEQ ID NO:1 3: 
[0222] 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
io (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(vii) IMMEDIATE SOURCE: 

15 

(B) CLONE: pALBP2-781 

(ix) FEATURE: 

20 (A) NAME/KEY: CDS 

(B) LOCATION: 2724.. 3071 

(ix) FEATURE: 

25 (A) NAME/KEY: terminator 

(B) LOCATION: 31 50.. 321 8 

(ix) FEATURE: 

30 (A) NAME/KEY: RBS 

(B) LOCATION: 2222.. 2723 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

GACGAAAGGC CCTCGTGATA CGCCTATTTT TATAGGTTAA TGTCATCATA ATAATGGTTT 60 

CTTAGACGTC AGGTGGCACT TTTCGGGGAA ATGTGCGCGG AACCCCTATT TGTTTATTTT 12 0 

TCTAAATACA TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT 180 

AATATTGAAA AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCG CCCTT ATTCCCTTTT 24 0 

TTGCGGCATT TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 3 00 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT CGATCTCAAC AGCGGTAAGA 3 60 
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TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT GAG CACTTTT AAAGTTCTGC 
TATGTGGCGC GGTATTATCC CGTATTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC 
ACTATTCTCA GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG 
GCATGACAGT AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 
ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG CACAACATGG 660 
GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC ATACCAAACG 720 
ACGAGCGTGA CACCACGATG CCTGTAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG 
GCGAACTACT TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG 
TTGCAGGACC A CTTCTG CG C TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 900 
GAGCCGGTGA G CGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT GGTAAGCCCT 960 
CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC TATGGATGAA CGAAATAGAC 
AGATCGCTGA GATAGGTGCC TCACTGATTA AGCATTGGTA ACTGTCAG A C CAAGTTTACT 
CATATATACT TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC TAGGTGAAGA 
TCCTTTTTGA TAATCTCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT 
CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT 
GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC 
TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTCC 
30 TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC 

TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG 1500 
GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGG G TT X560 
35 CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG 

AGCATTGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG 
GCAGGGTCGG AACAGGAGAG CG CACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT 
4Q ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG 

GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT 
GCTGGCCTTT TGCTCACATG TTCTTTCCTG CGTTATCCCC TGATTCTGTG GATAACCGTA 
^ TTACCGCCTT TGAGTGAGCT GATACCGCTC GCCGCAGCCG AACGACCGAG CGCAGCGAGT 

CAGTGAGCGA GGAAGCGGAA GAGCGCCCAA TACGCAAACC GCCTCTCCCC GCGCGTTGGC 
CGATTCATTA ATGCAGAATT GATCTCTCAC CTACCAAACA ATGCCCCCCT GCAAAAAATA 



20 



25 



420 
480 
540 
600 



780 
840 



1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 



1620 

1680 

1740 

1800 

I860 

1920 

1980 

2040 

2100 



^ AATTCATATA AAAAACATAC AGATAACCAT CTGCGGTGAT AAATTATCTC TCGCGGTGTT 2160 
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10 



15 



20 



25 



GACATAAATA 


CCACTGGCGG 

v* Vp» n j. \j \j \j \J 


TGATACTGAG 


CACATCAGCA 


GGACGCACTG 


ACCACCATGA 


o ~% *i f\ 
2220 


AGGTGACGCT 


CTTAAAAATT 


AAGCCCTGAA 


GAAGGGCAGC 


ATTCAAAGCA 


GAAGGCTTTG 


£. £■ O \J 


GGGTGTGTGA 


TACGAAACGA 


AGCATTGGCC 


GTAAGTGCGA 


TTCCGGATTA 


GCTGCCAATG 




TGCCAATCGC 


GGGGG GTTTT 


CGTTCAGGAC 


TACAACTGCC 


ACACACCACC 


AAAGCTAACT 


2400 


GACAGGAGAA 


TCCAGATGGA 


TGCACAAACA 


CGCCGCCGCG 


AACGTCGCGC 


AGAGAAACAG 


2460 


GCTCAATGGA 


AAG C AG C AAA 


TCCCCTGTTG 


GTTGGGGTAA 


GCGCAAAACC 


AGTTCCGAAA 


2520 


GATTTTTTTA 


ACTATAAACG 


CTGATGGAAG 


CGTTTATGCG 


GAAGAGGTAA 


AGCCCTTCCC 


2580 


GAGTAACAAA 


AAAACAACAG 


CATAAATAAC 


CCCGCTCTTA 


CACATTCCAG 


CCCTGAAAAA 


2640 


GGGCATCAAA 


TTAAACCACA 


CCTATGGTGT 


ATGCATTTAT 


TTGCATACAT 


TCAATCAATT 


2700 


GTTATCTAAG 


G AAA T A CTT A 


CAT ATG CAA GCT AAA CAT AAA CAA CGT AAA 


2750 



Met Gin Ala Lys His Lys Gin Arg Lys 
1 5 

CGT CTG AAA TCT AGC TGT AAG AGA CAC CCT TTG TAC GTG GAC TTC AGT 2798 
Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp Phe Ser 
10 15 20 25 

GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC GCC 28 4 6 

Asp Val Gly Trp Asn Asp Trp lie Val Ala Pro Pro Gly Tyr His Ala 
30 35 40 

TTT TAC TGC CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG AAC 2894 
Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn 
45 50 55 

30 TCC ACT AAT CAT GCC ATT GTT CAG ACG TTG GTC AAC TCT GTT AAC TCT 294 2 

Ser Thr Asn His Ala lie Val Gin Thr Leu Val Asn ser Val Asn Ser 

60 65 70 

AAG ATT CCT AAG GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT &TC TCG 2990 
Lys lie Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala lie Ser 
35 75 80 85 

ATG CTG TAC CTT GAC GAG AAT GAA AAG GTT GTA TTA AAG AAC TAT CAG 3038 
Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr Gin 
90 95 100 105 

40 GAC ATG GTT GTG GAG GGT TGT GGG TGT CGC TAGTACAGCA AAATTAAATA 3088 

Asp Met val Val Glu Gly Cys Gly Cys Arg 
110 115 



50 



CATAAATATA 


TATATATATA 


TATATTTTAG 


AAAAAAGAAA 


AAAATCTAGA 


GTCGACCTGC 


3148 


AGTAATCGTA 


CAGGGTAGTA 


CAAATAAAAA 


AGGCACGTCA 


GATGACGTGC 


CTTTTTTCTT 


3208 


GTGAGCAGTA 


AGCTTGGCAC 


TGGCCGTCGT 


TTTACAACGT 


CGTGACTGGG 


AAAACCCTGG 


3268 


CGTTACCCAA 


CTTAATCGCC 


TTG C AGC AC A 


TCCCCCTTTC 


GCCAGCTGGC 


GTAATAG CG A 


3328 


AGAGGCCCGC 


ACCGATCGCC 


CTTCCCAACA 


GTTGCGCAGC 


CTGAATGGCG 


AATGGCGCCT 


3388 



61 



EP 0 612 348 B1 



10 



20 



40 



GATGCGGTAT TTTCTCCTTA CGCATCTGTG CGGTATTTCA CACCGCATAT ATGGTGCACT 3448 
CTCAGTACAA TCTGCTCTGA TGCCGCATAG TTAAGCCAGC CCCGACACCC GCCAACACCC 
GCTGACGCGC CCTGACGGGC TTGTCTGCTC CCGGCATCCG CTTACAGACA AGCTGTGACC 

CGCGA 3 62 3 



3508 

GTCTCCGGGA GCTGCATGTG TCAGAGGTTT TCACCGTCAT CACCGAAACG 



(2) INFORMATION FOR SEQ ID NO:14: 
[0223] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 



Met Gin Ala Lys His Lys cm Arg Lys Arg Leu Lys Ser Ser cys Lys 

10 15 

Arg His Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp 

25 30 

lie val Ala Pro Pro Gly Tyr H is Ala Phe Tyr Cys His Gly Glu Cys 

4 0 45 

Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala He Val 

• 5S 60 
Gin Thr Leu Val Asn Ser Val AS n Ser Lys He Pro Lys Ala Cys Cys 

75 80 

val Pro Thr Glu Leu ser. Ala He Ser Met Leu Tyr Leu Asp Glu Asn 
85 90 95 

Glu Lys Val Val Leu Lys Asn Tyr Gin Asp «et Val val Glu Gly Cy s 

105 110 

Gly Cys Arg 
115 

45 (2) INFORMATION FOR SEQ ID N0:15: 
[0224] 

(i) SEQUENCE CHARACTERISTICS' 

so 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

55 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 
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CATGGGCAGC TGAG 

(2) INFORMATION FOR SEQ ID NO:16: 
[0225] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genonic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

GAGGGTTGTG GGTGTCGCTA GTGAGTCGAC TACAGCAAAT 

(2) INFORMATION FOR SEQ ID NO:17: 
[0226] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

GGATGTGGGT GCCGCTGACT CTAGAGTCGA CGGAATTC 

(2) INFORMATION FOR SEQ ID NO:18: 
[0227] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

AATTCACCAT GATTCCTGGT AACCGAATGC T 
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(2) INFORMATION FOR SEQ ID NO:19: 
[0228] 

(t) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 



GTGGTACTAA GGACCATTGG CTTAC 

25 

(2) INFORMATION FOR SEQ ID NO:20: 
[0229] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 



35 CGACCTGCAG CCATGCATCT GACTGTA 

(2) INFORMATION FOR SEQ ID NO:21: 
[0230] 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

45 (C) STRANDEDN ESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

w (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21 : 



TGCCTGCAGT TTAATATTAG TGGCAGC 

27 
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(2) INFORMATION FOR SEQ ID NO:22: 
[0231] 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

15 

CGACCTGCAG CCACC 

(2) INFORMATION FOR SEQ ID NO:23: 

20 

[0232] 

(i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

35 TCGACCCACC ATGCCGGGGC TGGGGCGGAG GGCGCAGTGG CTGTG CTGGT GGTGGGGGCT 

GTGCTGCAGC TGCTGCGGGC C 

(2) INFORMATION FOR SEQ ID NO:24: 

40 

[0233] 

(i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

so (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

55 CGCAGCAGCT GCACAGCAGC CCCCACCACC AGCACAGCCA CTGCGCCCTC CGCCCCAGCC 

CCGGCATGGT GGG 
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(2) INFORMATION FOR SEQ ID NO:25: 
[0234] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 



TCGACTGGTT T 

(2) INFORMATION FOR SEQ ID NO:26: 
[0235] 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 



35 CGAAACCAG 

(2) INFORMATION FOR SEQ ID NO:27: 
[0236] 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 



TCGACAGGCT CGCCTGCA 
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(2) INFORMATION FOR SEQ ID NO:28: 
[0237] 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
w (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



GTCCGAG CGG 

(2) INFORMATION FOR SEQ ID NO:29: 

20 

[0238] 

(i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

CAGCTCGACC CACCATGCAC GTGCGCTCA 

35 

(2) INFORMATION FOR SEQ ID NO:30: 
[0239] 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

45 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 

TCTGTCGACC TCGGAGGAGC TAGTGGC 
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Claims 



2. 



A method for producing a heterodimeric protein having bone stimulating activity comprising culturing a selected 
host cell containing a nucleotide sequence encoding a first selected BMP or fragment thereof and a nucleotide 
sequence encoding a second selected BMP or fragment thereof , said nucleotide sequences each being under the 
control of a suitable regulatory sequence capable of directing co-expression of said proteins, and isolating said 
heterodimenc protein from the culture medium, wherein said heterodimeric protein is a human BMP-2/5 BMP-2/6 
BMP-4/5, BMP-4/6 or BMP-4/7 heterodimer. 

The method of claim 1 , wherein said nucleotide sequences are present on individual vectors transfected into said 
host cell. 

3. The method of claim 2, wherein more than a single copy of the gene encoding each said BMP or fragment thereof 
is present on each vector. 

4. The method of claim 1 , wherein both said nucleotide sequences are present on a single vector. 

5. The method of claim 1 , wherein both said nucleotide sequences are incorporated into a chromosome of said host 
cell. 

6. The method of any one of claims 1 to 3, wherein said host cell is a hybrid cell prepared by culturing two fused 
selected, stable host cells, each host cell transfected with a nucleotide sequence encoding a selected first or 
second BMP or fragment thereof, said nucleotide sequences under the control of a suitable regulatory sequence 
capable of directing expression of each protein or fragment. 

7. The method of any one of claims 1 to 6, wherein said host cell is a mammalian cell, an insect cell or a yeast cell. 

8. A method for producing a heterodimeric protein having bone stimulating activity in a bacterial cell comprising 
culturing a selected host cell containing a nucleotide sequence encoding a first selected BMP or fragment thereof 
under the control of a suitable regulatory sequence capable of directing expression of the protein or protein frag- 
ment under conditions suitable for the formation of soluble, monomeric protein; culturing a selected host cell con- 
taining a nucleotide sequence encoding a second BMP or fragment thereof underthe control of a suitable regulatory 
sequence capable of directing expression-of the protein or protein fragment under said conditions to form a second 
soluble, monomeric protein; and mixing said soluble monomeric proteins under conditions permitting the formation 
of dimeric proteins associated by at least one covalent disulfide bond; and isolating from the mixture a heterodimeric 
protein, wherein said heterodimeric protein is a human BMP-2/5, BMP-2/6, BMP-4/5, BMP-4/6 or BMP-4/7 het- 
erodimer. 

9. The method of claim 8, wherein said host cell is E. coli. 

10. The method of claim 8 or 9, wherein said conditions comprise treating said protein with a solubilizing agent 

11. A cell line comprising a nucleotide sequence encoding a first BMP or fragment thereof under the control of a 
suitable expression regulatory system and a nucleotide sequence encoding a second BMP or fragment thereof 
under the control of a suitable expression regulatory system, said regulatory systems capable of directing the co- 
expression of said BMPs or fragments thereof and the formation of heterodimeric protein, wherein said recombinant 
heterodimeric protein is a human BMP-2/5, BMP-2/6, BMP-4/5, BMP-4/6 or BMP-4/7 heterodimer. 

12. The cell line of claim 11 , wherein said nucleotide sequences are present in a single DNA molecule. 

13. The cell line of claim 11, wherein said nucleotide sequences are present on different DNA molecules. 

14. The cell line of claim 12, wherein said single DNA molecule comprises a first transcription unit containing a gene 
encoding a first BMP or fragment thereof and a second transcription unit containing a gene encoding a second 
BMP or fragment thereof. 



15. 



The cell line of claim 1 2, wherein said single DNA molecule comprises a single transcription unit containing multiple 
copies of said gene encoding said first BMP or fragments thereof and multiple copies of said gene encoding said 
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second BMP or fragments thereof. 

16. A DNA molecule comprising a nucleotide sequence encoding a first selected BMP or fragment thereof and a 
nucleotide sequence encoding a second, different selected BMP or fragment thereof, said nucleotide sequences 
being under the control of at least one suitable regulatory sequence capable of directing co-expression of each 
BMP or fragment thereof, wherein said first selected BMP is BMP-2 or BMP-4, and said second selected BMP is 
BMP-5 or BMP-6, or wherein said first selected BMP is BMP-4 and said second selected BMP is BMP-7. 

17. The molecule of claim 16 comprising a first transcription unit containing a gene encoding a first BMP or fragment 
thereof and a second transcription unit containing a gene encoding a second BMP or fragment thereof. 

18. The molecule of claim 16 comprising a single transcription unit containing multiple copies of said gene encoding 
said first BMP or fragments thereof and multiple copies of said gene encoding said second BMP or fragments 
thereof. 

19. A recombinant heterodimeric protein having bone stimulating activity comprising a protein or fragment of a first 
BMP in association with a second protein or fragment of a second BMP produced by co-expressing said proteins 
in a selected host cell, wherein said recombinant heterodimeric protein is a human BMP-2/5, BMP-2/6, BMP-4/5, 
BMP-4/6 or BMP-4/7 heterodimer. 

20. A pharmaceutical composition comprising a protein of claim 19. 

21 . Use of a recombinant heterodimeric protein of claim 1 9 for the production of a pharmaceutical composition for the 
treatment of bone defects, periodontal diseases, healing bone injury, wound healing or increasing neuronal survival 
or for surgery. 



Patentanspruche 

1 . Verfahren zur Herstellung eines heterodimeren Proteins, das knochenstimulierende Aktivitat hat, umfassend Zuch- 
ten einer ausgewahlten Wirtszelle, die eine Nucleotidsequenz enthalt, die ein erstes ausgewahltes BMP oder ein 
Fragment davon codiert, und eine Nucleotidsequenz, die ein zweites ausgewahltes BMP oder ein Fragment davon 
codiert, wobei die Nucleotidsequenzen jeweils unter der Kontroile einer geeigneten regulatorischen Sequenz sind, 
die imstande ist, die Co-Expression der Proteine zu regeln, und Isolieren des heterodimeren Proteins aus dem 
Kulturmedium, wobei das heterodimere Protein ein menschiiches BMP-2/5-, BMP-2/6-, BMP-4/5-, BMP-4/6- oder 
BMP-4/7- Heterodimer ist. 

2. Verfahren nach Anspruch 1 , wobei die Nucleotidsequenzen auf einzelnen in die Wirtszelle transfizierten Vektoren 
vorhanden sind. 

3. Verfahren nach Anspruch 2, wobei mehr als eine einzelne Kopie des Gens, das jeweils das BMP oder ein Fragment 
davon codiert, auf jedem Vektor vorhanden ist. 

4. Verfahren nach Anspruch 1 , wobei beide Nucleotidsequenzen auf einem einzelnen Vektor vorhanden sind. 

5. Verfahren nach Anspruch 1 , wobei beide Nucleotidsequenzen in ein Chromosom der Wirtszelle eingebaut werden. 

6. Verfahren nach einem der Anspruche 1 bis 3, wobei die Wirtszelle eine Hybridzelle ist, hergestellt durch Zuchten 
von zwei fusionierten, ausgewahlten, stabilen Wirtszellen, wobei'jede Wirtszelle mit einer Nucleotidsequenz trans- 
fiziert ist, die ein ausgewahltes erstes oder zweites BMP oder ein Fragment davon codiert, wobei die Nucleotid- 
sequenzen unter der Kontroile einer geeigneten regulatorischen Sequenz sind, die imstande ist, die Expression 
von jedem Protein oder Fragment zu regeln. 

7. Verfahren nach einem der Anspruche 1 bis 6, wobei die Wirtszelle eine Saugerzelle, eine Insektenzelle oder eine 
Hefezelle ist. 

8. Verfahren zur Herstellung eines heterodimeren Proteins, das knochenstimulierende Aktivitat hat, in einer Bakte- 
rienzelle, umfassend Zuchten einer ausgewahlten Wirtszelle, die eine Nucleotidsequenz enthalt, die ein erstes 
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ausgewahltes BMP oder ein Fragment davon codiert, unter der Kontrolle einer geeigneten regulatorischen Se- 
quenz, die imstande ist, die Expression des Proteins oder des Proteinfragmentes unter Bedingungen zu regeln, 
die fur die Bildung eines loslichen, monomeren Proteins geeignet sind; Zuchten einer ausgewahiten Wirtszelle, 
die eine Nucleotidsequenz enthalt, die ein zweites BM P oder ein Fragment davon codiert, unter der Kontrolle einer 
geeigneten regulatorischen Sequenz, die imstande ist, die Expression des Proteins oder des Proteinfragmentes 
unter den Bedingungen zu regeln, urn ein zweites losliches, monomeres Protein zu bilden; und Mischen der los- 
lichen, monomeren Proteine unter Bedingungen, die die Bildurig von dimeren Proteinen erlauben, die durch min- 
destens eine kovatente Disulfidbindung verbunden sind; und Isolieren eines heterodimeren Proteins aus dem Ge- 
misch, wobei das heterodimere Protein ein menschliches BMP-2/5-, BMP-2/6-, BMP-4/5-, BMP-4/6- oder BMP- 
4/7-Heterodimer ist. 

9. Verfahren nach Anspruch 8, wobei die Wirtszelle E. coli ist. 

10. Verfahren nach Anspruch 8 oder 9, wobei die Bedingungen das Behandeln des Proteins mit einem Solubilisie- 
rungsmittel umfassen. 

1 1 . Zellinie, umfassend eine Nucleotidsequenz, die ein erstes BMP oder ein Fragment davon codiert, unter der Kon- 
trolle eines geeigneten expressions-regulatorischen Systems, und eine Nucleotidsequenz, die ein zweites BMP 
oder ein Fragment davon codiert, unter der Kontrolle eines geeigneten expressions-regulatorischen Systems, wo- 
bei das regulatorische System imstande ist, die Co-Expression der BMPs oder der Fragmente davon und die 
Bildung von heterodimerem Protein zu regeln, wobei das rekombinante, heterodimere Protein ein menschliches 
BMP-2/5-, BMP-2/6-, BMP-4/5-, BMP-4/6- oder BMP-4/7-Heterodimer ist. 

12. Zellinie nach Anspruch 11 , wobei die Nudeotidsequenzen in einem einzelnen DNA-Molekul vorhanden sind. 

13. Zellinie nach Anspruch 11, wobei die Nudeotidsequenzen auf unterschiedlichen DNA-Molekulen vorhanden sind. 

14. Zellinie nach Anspruch 12, wobei das einzelne DNA-Molekul eine erste Transkriptionseinheit umfasst, enthaltend 
ein Gen, das ein erstes BMP oder ein Fragment davon codiert, und eine zweite Transkriptionseinheit, enthaitend 
ein Gen, das ein zweites BMP oder ein Fragment davon codiert. 

15. Zellinie nach Anspruch 12, wobei das einzelne DNA-Molekul eine einzelne Transkriptionseinheit umfasst, enthal- 
tend mehrere Kopien des Gens, das das erste BMP oder Fragmente davon codiert, und mehrere Kopien des Gens, 
das das zweite BMP oder Fragmente davon codiert. 

16. DNA-Molekul, umfassend eine Nucleotidsequenz, die ein erstes ausgewahltes BMP oder ein Fragment davon 
codiert, und eine Nucleotidsequenz, die ein zweites, unterschiedliches, ausgewahltes BMP oder ein Fragment 
davon codiert, wobei die Nudeotidsequenzen unter der Kontrolle von mindestens einer geeigneten reguluatori- 
schen Sequenz sind, die imstande ist, die Co-Expression von jedem BMP oder Fragment davon zu regeln, wobei 
das erste ausgewahlte BMP BMP-2 oder BMP-4 ist, und das zweite ausgewahlte BMP BMP-5 oder BMP-6 ist, 
oder wobei das erste ausgewahlte BMP BMP-4 ist und das zweite ausgewahlte BMP BMP-7 ist. 

17. Molekul nach Anspruch 16, umfassend eine erste Transkriptionseinheit, die ein Gen enthalt, das ein erstes BMP 
oder ein Fragment davon codiert, und eine zweite Transkriptionseinheit, die ein Gen enthalt, das ein zweites BMP 
oder ein Fragment davon codiert. 

18. Molekul nach Anspruch 16, umfassend eine einzelne Transkriptionseinheit, diemehrfache Kopien des Gens ent- 
halt, das das erste BMP oder Fragmente davon codiert, und mehrfache Kopien des Gens, das das zweite BMP 
oder Fragmente davon codiert. 

19. Rekombinantes, heterodimeres Protein mit knochenstimulierender Aktivitat, umfassend ein Protein oder ein Frag- 
ment eines ersten BMP in Verbindung mit einem zweiten Protein oder Fragment eines zweiten BMP, hergestellt 
durch Co-Exprimieren der Proteine in einer ausgewahiten Wirtszelle, wobei das rekombinante, heterodimere Pro- 
tein ein menschliches BMP-2/5-, BMP-2/6-, BMP-4/5-, BMP-4/6- oder BMP-4/7-Heterodimer ist. 

20. Arzneimittel, umfassend ein Protein nach Anspruch 19. 

21. Verwendung eines rekombinanten, heterodimeren Proteins nach Anspruch 19 fur die Hersteliung eines Arznei- 
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mittelszurBehandlung von Knochendefekten, Periodontalerkrankungen, Heilung von Knochenverletzung, Wund- 
heilung oder Steigerung neuronalen Uberlebens Oder zur Chirurgte. 

Revendications 

1. Methode pour produire une proteine heterodimerique ayant une activite de stimulation osseuse, comprenant les 
etapes consistant a cultiver une cellule note selectionnee contenant une sequence nucleotidique codant une pre- 
miere BMP selectionnee ou I'un de ses fragments et une sequence nucleotidique codant une seconde BMP se- 
lectionnee ou I'un de ses fragments, lesdites sequences nucleotidiques etant chacune sous le controle d'une 
sequence regulatrice appropriee capable de dinger la co-expression desdites proteines, et a isoler ladite proteine 
heterodimerique a partir du milieu de culture, ladite proteine heterodimerique etant un heterodimere BMP-2/5, 
BMP-2/6, BMP-4/5, BMP-4/6 ou BMP-4/7 humain. 

2. Methode de la revendication 1 , dans laquelle lesdites sequences nucleotidiques sont presentes sur des vecteurs 
individuels transfectes dans ladite cellule note. 

3. Methode de la revendication 2, dans laquelle plus d'une seule copie du gene codant chacune desdites BMP ou 
I'un de leurs fragments sont presentes sur chaque vecteur. 

4. Methode de la revendication 1 , dans laquelle les deux dites sequences nucleotidiques sont presentes sur un seul 
vecteur. 

5. Methode de la revendication 1 , dans laquelle les deux dites sequences nucleotidiques sont incorporees dans un 
chromosome de ladite cellule hote. 

6. Methode selon Tune quelconque des revendications 1 a 3, dans laquelle ladite cellule hote est une cellule hybride 
preparee en cultivant deux cellules notes stables selectionnees fusionnees, chaque cellule hote transfectee par 
une sequence nucleotidique codant une premiere ou une seconde BMP selectionnee ou I'un de leurs fragments, 
lesdites sequences nucleotidiques etant sous le controle d'une sequence regulatrice appropriee capable de dinger 
I'expression de chaque proteine ou fragment. 

7. Methode de Tune quelconque des revendications 1 a 6, dans laquelle ladite cellule hote est une cellule de mam- 
mifere, une cellule d'insecte ou une cellule de levure. 

8. Methode pour produire une proteine heterodimerique ayant une activite de stimulation osseuse dans une cellule 
bacterienne, comprenant les etapes consistant a cultiver une cellule hote selectionnee contenant une sequence 
nucleotidique codant une premiere BMP selectionnee ou I'un de ses fragments sous le controle d'une sequence 
regulatrice appropriee capable de diriger I'expression de la proteine ou du fragment de proteine dans des conditions 
appropriees a la formation d'une proteine monomerique soluble ; a cultiver une cellule hote selectionnee contenant 
une sequence nucleotidique codant une seconde BMP selectionnee ou I'un de ses fragments sous le controle 
d'une sequence regulatrice appropriee capable de diriger I'expression de la proteine ou du fragment de proteine 
dans lesdites conditions pourformer une seconde proteine monomerique soluble ; et a melanger lesdites proteines 
monomeriques solubles dans des conditions permettant la formation de proteines dimehques associees par au 
moins un pont disulfure covalent ; et a isoler a partir du melange une proteine heterodimerique, ladite proteine 
heterodimerique etant un heterodimere BMP-2/5, BMP-2/6, BMP-4/5, BMP-4/6 ou BMP-4/7 humain. 

9. Methode de la revendication 8, dans laquelle ladite cellule hote est E. coli. 

10. Methode de la revendication 8 ou 9, dans laquelle lesdites conditions comprennent le traitement de ladite proteine 
avec un agent solubilisant. 

11. Lignee cellutaire comprenant une sequence nucleotidique codant une premiere BMP ou I'un de ses fragments 
sous le controle d'un systeme de regulation de I'expression approprie et une sequence nucleotidique codant une 
seconde BMP ou I'un de ses fragments sous le controle d'un systeme de regulation de ('expression approprie, 
lesdits systemes de regulation etant capables de diriger la co-expression desdites BMP ou de leurs fragments et 
la formation d'une proteine heterodimerique, ladite proteine heterodimerique recombinee etant un heterodimere 
BMP-2/5, BMP-2/6, BMP-4/5, BMP-4/6 ou BMP-4/7 humain. 
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12. Lignee cellulaire de la revendication 1 1 , dans laquelle lesdites sequences nucleotidiques sont presentes dans une 
seule molecule d'ADN. 

13. Lignee cellulaire de la revendication 11 , dans laquelle lesdites s6quences nucleotidiques sont presentes sur des 
5 molecules d'ADN differentes. 

14. Lignee cellulaire de la revendication 12, dans laquelle ladite unique molecule d'ADN comprend une premiere unite 
de transcription contenant un gene codant une premiere BMP ou Tun de ses fragments et une seconde unite de 
transcription contenant un gene codant une seconde BMP ou I'un de ses fragments 

10 

15. Lignee cellulaire de la revendication 12, dans laquelle ladite unique molecule d'ADN comprend une seule unite 
de transcription contenant plusieurs copies dudit gene codant ladite premiere BMP ou des fragments de celle-ci 
et plusieurs copies dudit gene codant ladite seconde BMP ou des fragments de celle-ci. 

Molecule d'ADN comprenant une sequence nucleotidique codant une premiere BMP selectionnee ou Tun de ses 
fragments et une sequence nucleotidique codant une seconde BMP selectionnee dtfferente ou I'un de ses frag- 
ments, lesdites sequences nucleotidiques etant sous le controle d'au moins une sequence regulatrice appropriee 
capable de diriger la co-expression de chaque BMP ou de leurs fragments, dans laquelle ladite premiere BMP 
selectionnee est BMP-2 ou BMP-4, et ladite seconde BMP selectionnee est BMP-5 ou BMP-6, ou dans laquelle 
ladite premiere BMP selectionnee est BMP-4 et ladite seconde BMP selectionnee est BMP-7. 

17. Molecule de la revendication 16 comprenant une premiere unite de transcription contenant un gene codant une 
premiere BMP ou I'un de ses fragments et une seconde unite de transcription contenant un gene codant une 
seconde BMP ou I'un de ses fragments, 

25 

18. Molecule de la revendication 16, comprenant une seule unite de transcription contenant plusieurs copies dudit 
gene codant ladite premiere BMP ou des fragments de celle-ci et plusieurs copies dudit gene codant ladite seconde 
BMP ou des fragments de celle-ci. 

30 19. Proteine heterodimerique recombinee ayant une activite de stimulation osseuse comprenant une proteine ou un 
fragment d'une premiere BMP en association avec une seconde proteine ou un fragment d'une seconde BMP 
produite par co-expression desdites proteines dans une cellule note selectionnee, ladite proteine heterodimerique 
recombinee etant un heterodimere BMP-2/5, BMP-2/6, BMP-4/5, BMP-4/6 ou BMP-4/7 humain. 

35 20. Composition pharmaceutique comprenant une proteine de la revendication 1 9. 

21 . Utilisation d'une proteine heterodimerique recombinee de la revendication 1 9 pour la production d'une composition 
pharmaceutique pour le traitement de defauts osseux, de maladies desmodontales, la guerison de lesions osseu- 
ses, la cicatrisation de plaies ou I'accroissement de la survie neuronale ou en chirurgie. 

40 
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FIGURE 1A 



10 20 30 40 50 60 70 

GTCGACTCTA GAGTGTGTGT CAGCACTTGG CTOGGGACTT CTTGAACTTG CAGGGAGAAT AACTTGCGCA 



80 90 100 110 120 130 140 

CCCCACTTTG CGCCGGTGCC TTTGCCCCAG CGG AGCCTG C TTCGCCATCT CCGAGCCCCA CCGCCCCTCC 



150 160 170 ISO 190 200 210 

ACTCCTCGGC CTTGCCCGAC ACTGAGACGC TGTTCCCAGC CTGAAAAGAG AGACTGCGCG GCCGGCACCC 



220 230 240 250 260 270 280 

GGGAGAAGGA GGAGGCAAAG AAAAGGAACG GACATTCGGT CCTTGCGCCA GGTCCTTTGA CCAGAGTTTT 



290 300 310 320 330 340 350 

TCCATGTGGA CGCTCTTTCA ATGGACGTGT CCCCGCGTGC TTCTTAGACG GACTGCGGTC TCCTAAAGGT 



<X) 370 385 400 

CGACC ATG GTG GCC GGG ACC CGC TGT CTT CTA GCG TTG CTG CTT CCC CAG GTC 
MET Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 

415 „ 430 445 

CTC CTG GGC GGC GCG GCT GGC CTC GTT CCG GAG CTG GGC CGC AGG AAG TTC GCG 
Leu Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys Phe Ala 

(24) 

460 475 490 505 

GCG GCG TCG TCG GGC CGC CCC TCA TCC CAG CCC TCT GAC GAG CTC CTG AGC GAG 

Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu val Leu Ser Glu 

520 535 550 565 

TTC GAG TTG CGG CTC CTC AGC ATG TTC GGC CTG AAA CAG AGA CCC ACC CCC AGC 
Phe Glu Leu Arg Leu Leu Ser MET Phe Gly Leu Lys Gin Arg Pro Thr Pro Ser 

580 595 610 

AGG GAC GCC GTG GTG CCC CCC TAC ATG CTA GAC CTG TAT CGC AGG CAC TCA GGT 
Arg Asp Ala Val Val Pro Pro Tyr MET Leu Asp Leu Tyr Arg Arg His Ser Gly 

625 640 655 670 

CAG CCG GGC TCA CCC GCC CCA GAC CAC CGG TTG GAG AGG GCA GCC AGC CGA GCC 

Gin Pro Gly Ser Pro Ala Pro Asp His Arg Leu Glu Arg Ala Ala Ser Arg Ala 
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FIGURE IB 



685 700 7i 5 

AAC ACT GTG CGC AGC TTC CAC CAT GAA GAA TCT TTG GAA GAA CTA CCA GAA ACC 
Asn Thr val Arg Ser Phe His His Glu Glu Ser Leu Glu Glu Leu Pro Glu Th? 

730 745 760 77 5 

ACT GGG AAA ACA ACC CGG AGA TTC TTC TTT AAT TTA AGT TCT ATC CCC Arc p»r 

ser Gly Lys Thr Thr Arg Arg Phe Phe Phe AbS ™ JS Se? £Te ?h? G?u 

790 805 820 Hi* 

Clo SI t?2 Sk C I CA °? A GAG CAG GTT WC CGA GAA CAG ATG CAA GAT GOT 

Glu Phe He Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin MET Gin Asp Ala 

850 865 880 

TTA GGA AAC AAT AGC AGT TTC CAT CAC CGA ATT AAT ATT TAT GAA ATC ATA AAA 
Leu Gly Asn Asn Ser Ser Phe His His Arg lie Asn He Tyr Glu lie lie £yt 

895 9io 925 „..„ 

CCT GCA ACA GCC AAC TCG AAA TTC CCC GTG ACC AGA CTT TTG CAC ACC AGG TTG 
Pro Ala Thr Ala Asn a*r Lys Phe Pro Val Thr Arg Leu Leu Asp ££r Arg SS 



955 970 



«r * w 9Q5 

GTG AAT CAG AAT GCA AGC AGG TGG GAA AGT TTT GAT GTC ACC CCC GCT GTG ATG 
Val Asn Gin Asn Ala Ser Arg Trp Glu Ser Phe Asp Val Thr Pro Ala Val MET 

1000 1015 1030 io4S 

? GG l GG £? T G ? A CAG GGA CAC GCC AAC GGA TTC GTG GTG GAA GTG GCC CAC 

Arg Trp Thr Ala Gin Gly His Ala Asn His Gly Phe Val Val Glu Val Ala Sis 

1060 * 1075 1090 n ftt: 

71? ^ G GGT GTC TCC AAG AGA CAT GTT AGG ATA AGC AGG TCT TTG 

Leu Glu Glu Lys Gin Gly val ser Lys Arg His Val Arg He Ser Arg Ser Leu 

(249) 

1120 H35 1150 

CAC CAA GAT GAA CAC AGC TGG TCA CAG ATA AGG CCA TTG CTA CTA ACT TTT GGC 
His Gin Asp Glu His Ser Trp Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly 

(266) 

1165 1180 H95 12 , Q 

CAT GAT GGA AAA GGG CAT CCT CTC CAC AAA AGA GAA AAA CGT CAA GCC AAA CAC 
Hxs Asp Gly Lys Gly His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His 

(283) 

!225 1240 1255 

AAA CAG CGG AAA CGC CTT AAG TCC AGC TGT AAG AGA CAC CCT TTG TAC GTG GAC 
Lys Gin Arg Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp 

(296) 

1270 1285 1300 1315 

TTC AGT GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC GCC 
Phe ser Asp Val Gly Trp Asn Asp Trp lie Val Ala Pro Pro Gly Tyr His Ala 
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FIGURE 1C 



1330 1345 1360 1375 

TTT TAG TGC CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG AAC TCC ACT 
Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr 

1390 1405 1420 

AAT CAT GCC ATT GTT CAG ACG TTG GTC AAC TCT GTT AAC TCT AAG ATT CCT AAG 
Asn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn Ser Lys He Pro Lys. 

1435 1450 1465 1480 

GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT ATC TCG ATG CTG TAC CTT GAC GAG 
Ala Cys Cys Val Pro Thr Glu Leu ser Ala He Ser MET Leu Tyr Leu Asp Glu 

1495 1510 1525 

AAT GAA AAG GTT GTA TTA AAG AAC TAT CAG GAC ATG GTT GTG GAG GGT TGT GGG 
Asn Glu Lys Val Val Leu Lys Asn Tvr G in Asp MET Val Val Glu Glv Cys Gly 

1540(396) 1553 1563 1573 1583 1593 1603 

TGT CGC T AG TAC AG C A AAATTAAATA CATAAATATA TATATATATA TATATTTTAG AAAAAAGAAA 
Cys Arg 



AAAA 
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FIGURE 2A 



10 20 30 40 50 60 70 

CTCTAGAGGG CAGAGGAGGA GGGAGGGAGG GAAGGACCGC GGUGCCCGGC CCGGAAGCTA GGTGAGTGTG 



80 90 100 110 120 130 140 

GCATCCGAGC TGAGGGACGC GAGCCTGAGA CGCCGCTGCT GCTCCGGCTG AGTATCTAGC TTGTCTCCCC 



150 160 170 180 190 200 210 

GATGGGATTC CCGTCCAAGC TATCTCGAGC CTGCAGCGCC ACAGTCCCCG GCCCTCGCCC AGGTTCACTG 



220 230 240 250 260 270 280 

CAACCGTTCA GAGGTCCCCA GGAGCTGCTG CTGGCGAGCC CCCTACTGCA GGGACCTATG GAG CC ATTCC 



290 300 310 320 330 340 350 

GTAGTGCCAT CCCGAGCAAC GCACTGCTGC AGCTTCCCTG AGCCTTTCCA GCAAGTTTGT TCAAGATTGG 



360 370 380 390 400 (1) 

CTGTCAAGAA TCATGGACTG TTATTATATG CCTTGTTTTC TGTCAAGACA CC ATG ATT CCT 

KET He Pro 

417 432 447 462 

GGT AAC CGA ATG CTG ATG GTC GTT TTA TTA TGC CAA GTC CTG CTA GGA GGC GCG 
Gly Asn Arg MET Leu MET Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly Ala 

477 492 507 

AGC CAT GCT AGT TTG ATA CCT GAG ACG GGG AAG AAA AAA GTC G6c GAG ATT CAG 
Ser His Ala Ser Leu lie Pro Glu Thr Gly Lys Lys Lys val Ala Glu lie Gin 

522 537 552 567 

GGC CAC GCG GGA GGA CGC CGC TCA GGG CAG AGC CAT GAG CTC CTG CGG GAC TTC 

Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu Leu Leu Arg Asp Phe 

582 597 612 627 

GAG GCG ACA CTT CTG CAG ATG TTT GGG CTG CGC CGC CGC CCG CAG CCT AGC AAG 
Glu Ala Thr Leu Leu Gin KET Phe Gly Leu Arg Arg Arg Pro Gin Pro ser Lys 

642 657 672 

AGT GCC GTC ATT CCG GAC TAC ATG CGG GAT CTT TAC CGG CTT CAG TCT GGG GAG 
Ser Ala Val lie Pro Asp Tyr MET Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu 
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FIGURE 2B 



687 702 717 732 

GAG GAG GAA GAG CAG ATC CAC AGC ACT GGT CTT GAG TAT CCT GAG CGC CCG GCC 
Glu Glu Glu Glu Gin lie His Ser Thr Gly Leu Clu Tyr Pro Glu Arg Pro Ala 

747 762 777 

AGC CGG GCC AAC ACC GTG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC 
Ser Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn He 

792 807 822 837 

CCA GGG ACC AGT GAA AAC TCT GCT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC 

Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser Ser He 

852 867 882 897 

CCT GAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG GTG 

Pro Glu Asn Glu Val lie Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin val 

912 927 942 

GAC CAG GGC CCT GAT TpG GAA AGG GGC TTC CAC CGT ATA AAC ATT TAT GAG GTT 
Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg He Asn He Tyr Glu Val 

957 972 987 1002 

ATG AAG CCC CCA GCA GAA GTG GTG CCT GGG CAC CTC ATC ACA CGA CTA CTG GAC 
MET Lys Pro Pro Ala Glu Val Val Pro Gly His Leu He Thr Arg Leu Leu Asp 

1017 1032 1047 

ACG AGA CTG GTC CAC CAC AAT GTG ACA CGG TGG GAA ACT TTT GAT GTG AGC CCT 
Thr Arg Leu Val His His Asn Val Thr Arg Trp Glu Thr Phe Asp Val Ser Pro 

1062 1077 1092 1107 

GCG GTC CTT CGC TGG ACC CGG GAG AAG CAG CCA AAC TAT GGG CTA GCC ATT GAG 

Ala Val Leu Arg Trp Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala He Glu 

1122 1137 1152 , 1167 

GTG ACT CAC CTC CAT CAG ACT CGG ACC CAC CAG CGC CAG CAT GTC AGG ATT AGC 
Val Thr His Leu His Gin Thr Arg Thr His Gin Gly Gin His val Arg He Ser 

1182 1197 1212 

CGA TCG TTA CCT CAA GGG AGT GGG AAT TGG GCC CAG CTC CGG CCC CTC CTG GTC 
Arg Ser Leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu val 

1227 1242 1257 1272 

ACC TTT GGC CAT GAT GGC CGG GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG 
Thr Phe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala Lys 

1287 1302 1317 

CGT AGC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAC AAT AAG AAC TGC CGG 

Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys Arg 
(293) 
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FIGURE 2C 



1332 1347 1362 1377 

CGC CAC TCG CTC TAT GTG GAC TTC AGC GAT GTG GGC TGG AAT GAC TGG ATT GTG 

Arg His Ser' Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp He Val 

1392 1407 1422 ^jL^Z 

GCC CCA CCA GGC TAC CAG GCC TTC TAC TGC CAT GGG GAC TGC CCC TTT CCA CTG 
Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly Asp Cys Pro Pne Pro Leu 

1452 1467 1432 

GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC ATT GTG CAG ACC CTG GTC AAT TCT 
Ala Asp His Leu Asn ser Tbr Asn Kis Ala Ho Val Gin Thr Leu Val Asn Ser 

1497 1512 1527 1542 

GTC AAT TCC AGT ATC CCC AAA GCC TGT TGT GTG CCC ACT GAA CTG AGT GCC ATC 
val Asn ser Ser lie pro Lys Ala Cys eye Val Pro Thr Glu Leu Ser Ala Ha 

1557 1572 1587 

TCC ATG CTG TAC CTG GAT CAG TAT GAT AAG GTG GTA CTG AAA AAT TAT CAG GAG 
Ser MET Leu Tyr Leu f&p Glu Tyr Asp Lys Val val Leu Lys Asp Tyr <?lT> Glu 

1602 1617 (40B) 1636 1646 1656 

ATG GTA GTA GAG GGA TGT GGG TGC CGC TGAGATCAGG CAGTCCTTCA GGATAGACAG 
MET val V al Glu Glv cys Gly Cys Arg 

1666 1676 16B6 1696 1706 1715 1726 

ATATACACAC CACACACACA CACCACATAC ACCACACACA CACGTTCCCA TCCACTCACC CACACACTAC 

1736 1746 1756 1766 1776 1786 1796 

ACAGACTGCT TCCTTATAGC TGGACTTTTA TTTAAAAAAA AAAAAAAAAA AATGGAAAAA ATC C CT AAA C 

1806 1816 1826 1836 1846 1856 1866 

ATTCACCTTG ACCTTATTTA TGACTTTACG TGCAAATGTT TTGACCATAT TG ATC AT AT A TTTTGACAAA 

1876 1886 1896 1906 1916 1926 

ATATATTTAT AACTACGTAT TAAAAGAAAA AAATAAAATG AGTCATTATT TTAAAAAAAA AAAAAAAACT 



1946 

CTAGAGTCGA CGGAATTC 
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FIGURE 3A 



10 20 30 40 50 

GTGkCCGAGC GGCG CGGACG GCCGCCTGCC CCCTCTGCCA CCTGGGGCGG 



60 70 80 90 99 

TGCGGGCCCG GAGCCCGGAG CCCGGGTAGC GCGTAGAGCC GGCGCG ATG 

MET 

(1) 

108 117 126 135 144 

CAC GTG CGC TCA CTG CGA GCT GCG GCG CCG CAC AGC TTC GTG GCG 
His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 

153 162 171 180 189 

CTC TGG GCA CCC CTG TTC CTG CTG CGC TCC GCC CTG GCC GAC TTC 
Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe 

198 207 216 225 234 

AGC CTG GAC AAC GAG GTG CAC TCG AGC TTC ATC CAC CGG CGC CTC 
Ser Leu Asp Asn Glu Val His Ser Ser Phe lie His Arg Arg Leu 

243 252 261 270 279 

CGC AGC CAG GAG CGG CGG GAG ATG CAG CGC GAG ATC CTC TCC ATT 

Arg Ser Gin Glu Arg Arg Glu MET Gin Arg Glu lie Leu Ser lie 

288 297 306 315 324 

TTG GGC TTG CCC CAC CGC CCG CGC CCG CAC CTC CAG GGC AAG CAC 
Leu Gly Leu Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His 

333 342 351 360 369 

AAC TCG GCA CCC ATG TTC ATG CTG GAC CTG TAC AAC GCC ATG GCG 
Asn Ser Ala Pro MET Phe MET Leu Asp Leu Tyr Asn Ala MET Ala 

378 387 396 405 414 

GTG GAG GAG GGC GGC GGG CCC GGC GGC CAG GGC TTC TCC TAC CCC 

Val Glu Glu Gly Gly Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro 

423 432 441 450 459 

TAC AAG GCC GTC TTC AGT ACC CAG GGC CCC CCT CTG GCC AGC CTG 
Tyr Lys Ala Val Phe Ser Thr Gin Gly Pro Pro Leu Ala Ser Leu 

468 477 486 495 504 

CAA GAT AGC CAT TTC CTC ACC GAC GCC GAC ATG GTC ATG AGC TTC 
Gin Asp Ser His Phe Leu Thr Asp Ala Asp MET Val MET Ser Phe 

513 522 531 540 549 

GTC AAC CTC GTG GAA CAT GAC AAG GAA TTC TTC CAC CCA CGC TAC 
Val Asn Leu Val Glu His Asp Lys Glu Phe Phe His Pro Arg Tyr 
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FIGURE 3B 



558 567 576 585 594 

CAC CAT CGA GAG TTC CGG TTT GAT CTT TCC AAG ATC CCA GAA GGG 
His His Arg Glu Phe Arg Phe Asp Leu Ser Lys lie Pro Glu Gly 

603 612 621 630 639 

GAA GCT GTC ACG GCA GCC GAA TTC CGG ATC TAC AAG GAC TAC ATC 
Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr Lys Asp Tyr lie 

648 657 666 675 684 

CGG GAA CGC TTC GAC AAT GAG ACG TTC CGG ATC AGC GTT TAT CAG 
Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg lie Ser Val Tyr Gin 

693 702 711 720 729 

GTG CTC CAG GAG CAC TTG GGC AGG GAA TCG GAT CTC TTC CTG CTC 
Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu Phe Leu Leu 

738 747 756 765 774 

GAC AGC CGT ACC CTC TGG GCC TCG GAG GAG GGC TGG CTG GTG TTT 
Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu Val Phe 

783 792 801 810 819 

GAC ATC ACA GCC ACC AGC AAC CAC TCG GTG GTC AAT CCG CGG CAC 
Asp lie Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg His 

828 837 846 855 864 

AAC CTG GGC CTG CAG CTC. TCG GTG GAG ACG CTG GAT GGG CAG AGC 
Asn Leu Gly Leu Gin Leu Ser val Glu Thr Leu Asp Gly Gin ser 

873 882 891 900 909 

ATC AAC CCC AAG TTG GCG GGC CTG ATT GGG CGG CAC GGG CCC CAG 
lie Asn Pro Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin 

918 927 936 945 954 

AAC AAG CAG CCC TTC ATG GTG GCT TTC TTC AAG GCC ACG GAG GTC 
Asn Lys Gin Pro Phe MET Val Ala Phe Phe Lys Ala Thr Glu Val 

963 972 981 990 999 

CAC TTC CGC AGC ATC CGG TCC ACG GGG AGC AAA CAG CGC AGC CAG 

His Phe Arg Ser He Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin 

(293) 

1008 1017 1026 1035 1044 

AAC CGC TCC AAG ACG CCC AAG AAC CAG GAA GCC CTG CGG ATG GCC 

Asn Arg Ser Lys Thr Pro Lys Asn Gin Glu Ala Leu Arcr MET Ala 

1053 1062 1071 1080 1089 

AAC GTG GCA GAG AAC AGC AGC AGC GAC CAG AGG CAG GCC TGT AAG 
Asn Val Ala Glu Asn Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys 
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FIGURE 3C 



1098 1107 1116 1125 1134 

AAG CAC GAG CTG TAT GTC AGO TTC CGA GAC CTG GGC TGG CAG GAC 
Lys His Glu Leu Tvr Val Ser Phe Arg Asp Leu Gly Trp Gin Asp 

1143 1152 1161 1170 1179 

TGG ATC ATC GCG CCT GAA GGC TAC GCC GCC TAG TAC TGT GAG GGG 
Trp lie lie Ala Pro Glu Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly 

1188 1197 1206 1215 1224 

GAG TGT GCC TTC CCT CTG AAC TCC TAC ATG AAC GCC ACC AAC CAC 
Glu Cys Ala Phe Pro Leu Asn Ser Tyr MET Asn Ala Thr Asn His 

1233 1242 1251 1260 1269 

GCC ATC GTG CAG ACG CTG GTC CAC TTC ATC AAC CCG GAA ACG GTG 
Ala He Val Gin Thr Leu Val His Phe He Asn Pro He Ser Val 

1278 c . 1287 1296 1305 1314 

CCC AAG CCC TGC TGT GCG CCC ACG CAG CTC AAT GCC ATC TCC GTC 
Pro Lys Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He Ser Val 

1323 1332 1341 1350 1359 

CTC TAC TTC GAT GAC AGC TCC AAC GTC ATC CTG AAG AAA TAC AGA 
Leu Tyr Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg 

1368 1377 1386 1399 

AAC ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCCTCC 
Asn MET val Val Arg Ala Cys Gly Cys His 

(431) 

1409 1419 1429 1439 1448 

GAGAATTCAG ACCCTTTGGG GCCAAGTTTT TCTGGATCCT CCATTGCTC 
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FIGURE 4A 



10 20 30 40 50 

CGACCATGAG AGATAAGGAC TGAGGGCCAG GAAGGGGAAG CGAGCCCGCC 

60 70 60 90 100 

GAGAGGTGGC GGGGACTGCT CACGCCAAGG GCCACAGCGG CCGCGCTCCG 

110 120 130 140 150 

GCCTCGCTCC GCCGCTCCAC GCCTCGCGGG ATCCGCGGGG GCAGCCCGGC 

159 168 177 186 195 

CGGGCGGGG ATG CCG GGG CTG GGG CGG AGG GCG CAG TGG CTG TGC 

MET Pro Gly Leu Gly Arg Arg Ala Gin Trp Leu Cys 
(1) 

204 213 222 231 240 

TGG TGG TGG GGG CTC CTG TGC AGC TGC TGC GGG CCC CCG CCG CTG 
Trp Trp Trp Gly Leu Leu Cys Ser Cys Cys Gly Pro Pro Pro Leu 

249 258 267 276 285 

CGG CCG CCC TTG CCC GCT CCC GCG GCC GCC GCC GCC GGG GGG CAG 
Arg Pro Pro Leu Pro Ala Ala Ala Ala Ala Ala Ala Gly Gly Gin 

294 303 312 321 330 

CTG CTG GGG GAC GGC GGG AGC CCC GGC CGC ACG GAG CAG CCG CCG 

Leu Leu Gly Asp Gly Gly Ser Pro Gly Arg Thr Glu Gin Pro Pro 

339 348 357 366 375 

CCG TCG CCG CAG TCC TCC TCG GGC TTC CTG TAC CGG CGG CTC AAG 
Pro Ser Pro Gin Ser Ser Ser Gly Phe Leu Tyr Arg Arg Leu Lys 

384 393 402 411 420 

ACG CAG GAG AAG CGG GAG ATG CAG AAG GAG ATC TTG TCG GTG CTG 
Thr Gin Glu Lys Arg Glu MET Gin Lys Glu lie Leu Ser Val Leu 



429 438 447 456 465 

GGG CTC CCG CAC CGG CCC CGG CCC CTG CAC GGC CTC CAA CAG CCG 
Gly Leu Pro His Arg Pro Arg Pro Leu His Gly Leu Gin Gin Pro 
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FIGURE 4B 

501 510 



474 483 „ „^ i-lr GAG CAC CAG CAG CAG CAG 

ss s ss ss ss ss ss ss s; ss «. - «» - 

«7 546 555 

519 528 _ *±L ran. CGA CTG AAG TCC GCG 

CAG CTO IS CGC CGA GAG CCC CCT CCC GCC CCA CT^ ^ ^ ^ 
Gin Leu Pro Arg Gly Giu pro « 

591 600 

- ss i is ss ss ss ss ^ s. - - - 

636 645 

609 616 -~ *rfi CAG CAG TCC TGG CCC 

0 AC GAG GAC GGG GCG TCG GAG GGG GAG AGG CAG C^ ^ pro 

Asp Giu Asp Gly Ala Ser giu 



672 «" S ll 

654 663 - li« rrc CAG CCG CCC CCG. GGC 

- «£ Hi SS SS K S°n S| SI Pro « o» s~ 



726 735 
699 708 IrJ CTT CTG GCC CCC G6A TCT 

OCC GCG c|l CCG «C AAC CGC £G ^ S« S.r I» Wu AXa 

Pro Pro Gly Ala Ala his tro *~ 



771 78 ° 



7 7 A 

744 753 „ m IJc AGC GCG CAG GAC AGC GCC 

SS SS It? S? S IS Pro SS S SS M. «» Asp s.r AXa 



• g^g 825 

789 798 „ !£Z are TTT GTG AAC CTG GTG 

TIC CTC AAC GAC GCG GAC ATG GTC ATG AGC CTT ^ ^ ^ ^ 

Phe Leu Asn Asp Ala Asp MET va* ^ 



861 870 

034 843 53* ^v;* CA c CAC AAA GAG 

CAG TAC GAC AAG GAG TIC TCC CCT °« ^ ^ Ss Hi, LY* 01» 

Giu Tyr Asp Lys Giu Phe Ser Fro m 



QO& 915 
097 906 



r7 q 888 ogT GAG GTG GTG ACG 

TTC AAG SI AAC TTA TCC CAG ATT CCT GAG GGT GA^ ^ ^ ^ 

Phe Lys Phe Asn Leu ser Gin lie *r ^ ^ 

924 9 33 TVT GTT ATG GGG AGT TTT 

CCT GCA SI TTC CGC ATC TAC AAG GAC TGT GTT A^ ^ ^ ^ 
Phe Arg lie Tyr Lys Asp Cys vai nr. 
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FIGURE 4C 



969 978 987 996 1005 

AAA AAC CAA ACT TTT CTT ATC AGC ATT TAT CAA GTC TTA CAG GAG 

Lys Asn Gin Thr Phe Leu lie Ser He Tyr Gin Val Leu Gin Glu 

< 

1014 1023 1032 1041 1050 

CAT CAG CAC AGA GAC TCT GAC CTG TTT TTG TTG GAC ACC CGT GTA 
His Gin His Arg Asp Ser Asp Leu Phe Leu Leu Asp Thr Arg Val 



1059 1068 1077 1086 1095 

GTA TGG GCC TCA GAA GAA GGC TGG CTG GAA TTT GAC ATC ACG GCC 
Val Trp Ala Ser Glu Glu Gly Trp Leu Glu Phe Asp lie Thr Ala 



1104 1113 1122 1131 1140 

ACT AGC AAT CTG TGG GTT GTG ACT CCA CAG CAT AAC ATG GGG CTT 

Thr ser Asn Leu Trp Val Val Thr Pro Gin His Asn MET Gly Leu 



1149 1158 1167 1176 1185 

CAG CTG "AGC GTG GTG AGA AGG GAT GGA GTC CAC GTC CAC CCC CGA 
Gin Leu Ser Val Val Thr Arg Asp Gly Val His Val His Pro Arg 



1194 1203 1212 1221 1230 

GCC GCA GGC CTG GTG GGC AGA GAC GGC CCT TAC GAT AAG CAG CCC 
Ala Ala Gly Leu Val Gly Arg Asp Gly Pro Tyr Asp Lys Gin Pro 



1239 1248 1257 1266 1275 

TTC ATG GTG GCT TTC TTC AAA GTG AGT GAG GTC CAC GTG CGC ACC 

Phe MET Val Ala Phe Phe Lys Val Ser Glu Val His Val* Arg Thr 



1284 1293 1302 1311 1320 

ACC AGG TCA GCC TCC AGC CGG CGC CGA CAA CAG AGT CGT AAT CGC 
Thr Arg ser Ala Ser Ser Arg Arg Arg Gin Gin Ser Arg Asn Arg 

(382) 

1329 1338 1347 1356 1365 

TCT ACC CAG TCC CAG GAC GTG GCG CGG GTC TCC AGT GCT TCA GAT 
Ser Thr Gin Ser Gin Asp Val Ala Ara Val Ser Ser Ala Ser Asp 
(388) 

1374 1383 1392 1401 1410 

TAC AAC AGC AGT GAA TTG AAA ACA GCC TGC AGG AAG CAT GAG CTG 
Tyr Asn ser Ser Glu Leu Lys Thr Ala Cys Arg Lys pis Glu Leu 

(412) 

1419 1428 1437 1446 1455 

TAT GTG AGT TTC CAA GAC CTG GGA TGG CAG GAC TGG ATC ATT GCA 
Tvr Val Ser Phe Gin Asp Leu Gly Trp Gin Asp Trp lie He Ala 
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FIGURE 4D 



1464 1473 1482 1491 1500 

CCC AAG GGC TAT GCT GCC AAT TAC TGT GAT GGA GAA TGC TCC TTC 
Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp Gly Glu Cys Ser Phe 



1509 1518 1527 1536 1545 

CCA CTC AAC GCA CAC ATG AAT GCA ACC AAC CAC GCG ATT GTG CAG 
Pro Leu Asn Ala His MET Asn Ala Thr Asn His Ala lie Val Gin 



1554 1563 1572 1581 1590 

ACC TTG GTT CAC CTT ATG AAC CCC GAG TAT GTC CCC AAA CCG TGC 
Thr Leu Val His Leu MET Asn Pro Glu Tyr Val Pro Lys Pro Cys 

1599 1608 1617 1626 1635 

TGT GCG CCA ACT AAG CTA AAT GCC ATC TCG GTT CTT TAC TTT GAT 
Cys Ala Pro Thr Lys Leu Asn Ala lie Ser Val Leu Tyr Phe Asp 



1644 1653 1662 1671 1680 

GAG AAC TCC AAT GTC ATT CTG AAA AAA TAC AGG AAT ATG GTT GTA 
Asp Asn Ser Asn val lie Leu Lys Lys Tyr Arg Asn MET val Val 



1689 1698 1708 1718 1728 

AGA GCT TGT GGA TGC CAC TAACT CG AAA CCAGATGCTG GGGACACACA 

Arg Ala Cys Gly Cys His 

(513) 



1738 


1748 


1758 


1768 


1778 


TTCTGCCTTG 


GATTCCTAGA 


TTACATCTGC 


CTTAAAAAAA 


CACGGAA0CA 


1788 


1798 


1808 


1818 


1828 


CAGTTGGAGG 


TGGGACGATG 


AGACTTTGAA 


ACTATCTCAT 


GCCAGTGCCT 


1838 


1848 


1858 


1868 


1878 


TATTACCCAG 


GAAGATTTTA 


AAGGACCTCA 


TTAATAATTT 


GCTCACTTGG 


1888 


1898 


1908 


1918 


1928 


TAAATGACGT 


GAGTAGTTGT 


TGGTCTGTAG 


CAAGCTGAGT 


TTGGATGTCT 


1938 


1948 


1958 


1968 


1978 


GT AG CAT AAG 


GTCTGGTAAC 


TGCAGAAACA 


TAACCGTGAA 


GCTCTTCCTA 


1988 


1998 


20OB 


2018 


2028 


CCCTCCTCCC 


CCAAAAACCC 


ACCAAAATTA 


GTTTTAGCTG 


TAGATCAAGC 


2038 


2048 


2058 


2068 


2078 


TATTTGGGGT 


GTTTGTTAGT 


AAATAGGGAA 


AATAATCTCA 


AAGGAGTTAA 


2088 


2098 


2108 


2118 


2128 


ATGTATTCTT 


GGCTAAAGGA 


TCAG CTGGTT 


CAGTACTGTC 


TATCAAAGGT 
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FIGURE 4E 



2138 2148 2158 2168 2178 

AGATTTTACA GAGAACAGAA ATCGGGGAAG TGGGG6GAAC GCCTCTGTTC 

2188 2196 2208 2218 2228 

AGTTCATTCC CAGAAGTCCA CAGGACGCAC AGCCCAGGCC ACAGCCAGGG 

2238 2248 2258 2268 2278 

CTCCACGGGG CGCCCTTGTC TCAGTCATTG CTGTTGTATG TTCGTGCTGG 

2288 2298 2308 2318 2328 

AGTTTTGTTG GTGTGAAAAT ACACTTATTT CAGCCAAAAC ATACCATTTC 

2338 2348 2358 2368 2378 

TACACCTCAA TCCTCCATTT GCTGTACTCT TTGCTAGTAC CAAAAGTAGA 



2388 2398 2408 2416 2428 

CTGATTACAC TGAGGTGAGG CTACAAGGGG TGTGTAACCG TGTAACACGT 



2438 2446 2458 2468 2478 

GAAGGCAGTG CTCACCTCTT CTTTACCAGA ACGGTTCTTT GACCAGCACA 

2488 2496 2508 2518 2528 

TTAACTTCTG GACTGCCGGC TCTAGTACCT TTTCAGTAAA GTGGTTCTCT 

2538 2548 2558 2568 2578 

GCCTTTTTAC TATACAGCAT ACCACGCCAC AGGGTTAGAA CCAACGAAGA 

2588 2598 2608 2618 2628 

AAATAAAATG AGGGTGCCCA GCTTATAAGA ATGGT6TTAG GGGGATGAGC 

2638 2648 2658 2668 2678 

ATGCTGTTTA TGAACGGAAA TCATGATTTC CCTGTAGAAA GTGAGGCTCA 

2688 2698 2708 2718 2728 

GATTAAATTT TAGAATATTT TCTAAATGTC TTTTTCACAA TCATGTGACT 



2738 2748 2758 2768 2778 

GGG AAGGGAA TTTCATACTA AACTGATTAA ATAATACATT TATAATCTAC 

2788 2798 2808 2818 2828 

AACTGTTTGC ACTTACAGCT TTTTTTGTAA ATATAAACTA TAATTTATTG 

2838 2848 2858 2868 2878 

TCTATTTTAT ATCTGTTTTG CTGTGGCGTT GGGGGGGGGG CCGGG CTTTT 

2888 2898 2908 2918 

GGGGGGGGGG GTTTGTTTGG GGGGTGTCGT GGTGTGGGCG GGCGG 



86 



EP 0 612 348 B1 



FIGURE 5A 



10 20 30 40 50 

CTGGTATATT TGTGCCTGCT GGAGGTGGAA TTAACAGTAA GAAGGAGAAA 

60 70 80 90 100 

GGGATTGAAT GGACTTACAG GAAGGATTTC AAGTAAATTC AGGGAAACAC 

110 120 130 140 150 

ATTTACTTGA ATAGTACAAC CTAGAGTATT ATTTTACACT AAGACGACAC 

160 170 180 190 200 

AAAAGATGTT AAAGTTATCA CCAAGCTGCC GGACAGATAT ATATTCCAAC 

210 220 230 240 250 

ACCAAGGTGC AGATCAGCAT AGAT CTGTG A TTCAGAAATC AGGATTTGTT 

260 270 280 290 300 

TTGGAAAGAG CTCAJ^GGGTT GAGAAGAACT CAAAAGCAAG TGAAGATTAC 

310 320 330 340 350 

TTTGGGAACT ACAGTTTATC AGAAGATCAA CTTTTGCTAA TTCAAATACC 

360 370 380 390 400 

AAAGGCCTGA TTATCATAAA TTCATATAGG AATGCATAGG TCATCTGATC 

410 420 430 440 450 

AAATAATATT AGCCGTCTTC TGCTACATCA ATGCAGCAAA AACTCTTAAC 

460 470 480 490 500 

AA CTGTGG AT AATTGGAAAT CTGAGTTTCA GCTTTCTTAG AAATAACTAC 

510 520 530 540 550 

TCTTGACATA TTCCAAAATA TTTAAAATAG GACAGGAAAA TCGGTGAGGA 

560 570 580 590 600 

TGTTGTGCTC AGAAATGTCA CTGTCATGAA AAATAGGTAA ATTTGTTTTT 

610 620 630 640 650 

TCAGCTACTG GGAAACTGTA CCTCCTAGAA CCTTAGGTTT TTTTTTTTTT 

660 670 680 690 700 

AAGAGGACAA GAAGGACTAA AAATATCAAC TTTTGCTTTT GGACAAAA 
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FIGURE 5B 



■, A1 7io 719 728 737 

ATG CAT CTG ACT GTA TTT TTA CTT AAG GGT ATT GTG GOT TTC CTC 

m Sis Leu Thr Val Phe Leu Leu Lys Gly He Val Gly Phe Leu 

(1) 

nA* 755 764 773 782 

TGG AGC TGC TGG GTT CTA GTG GGT TAT GCA AAA GGA GGT TTG GGA 

%% ser cys Trp val Leu val Gly Tyr Ala Lys Gly Gly Leu Gly 

ta-i BOO 809 818 827 

GAC AAT CAT GTT CAC TCC AGT TTT ATT TAT A6A AGA CTA CGG AAC 

Asp Asn Sis val His ser Ser Phe He Tyr Arg Arg Leu Arg Asn 

BA5 854 863 872 

CAC GAA AGA CGG 6AA ATA CAA AGG GAA ATT CTC TCT ATC TTG GGT 
Sis Glu Arg Arg Glu He Gin Arg Glu He Leu Ser He Leu Gly 

oo-i 890 899 908 917 

TTG CCT CAC AGA CCC AGA CCA TTT TCA CCT GGA AAA ATG ACC AAT 

ieS Pro His Arg Pro Arg Pro Phe Ser Pro Gly Lys Gin Ala Ser 

p, e 944 953 962 

CAA GCG TCC TCT GCA CCT CTC TTT ATG CTG GAT CTC TAC AAT GCC 

ser Ala Pro Leu Phe MET Leu Asp Leu Tyr Asn Ala MET Thr Asn 

0-71 980 989 998 1007 

GAA GAA AAT CCT GAA GAG TCG GAG TAC TCA GTA AGG GCA TCC TTG 

S5 Glu Asn Pro Glu Glu Ser Glu Tyr Ser Val Arg Ala Ser Leu 

m-»R 1025 1034 1043 1052 

GCA GAA GAG ACC AGA GGG GCA AGA AAG GGA TAC CCA GCC TCT CCC 
Ala Glu Glu Thr Arg Gly Ala Arg Lys Gly Tyr Pro Ala Ser Pro 

1061 1070 1079 1088 1097 

AAT GGG TAT CCT COT CGC ATA CAG TTA TCT CGG ACG ACT CCT CTG 
HI Gly Tyr Pro Arg Arg He Gin Leu Ser Arg Thr Thr Pro Leu 

11ft , 1115 1124 H33 H* 2 

ACC ACC CAG AGT CCT CCT CTA GCC AGC CTC CAT GAT ACC AAC TTT 

tS Gin ser Pro Pro Leu Ala Ser Leu His Asp Thr Asn Phe 

1160 1169 1178 1187 

CTG AAT GAT GCT GAC ATG GTC ATG AGC TTT GTC AAC TTA GTT GAA 
Su Asn Asp Ala Asp MET Val MET Ser Phe Val Asn Leu Val Glu 

n or 1205 1214 1223 1232 

AGA GAC AAG GAT TTT TCT CAC CAG CGA AGG CAT TAC AAA GAA TTT 
A?g Asp lys Asp Phe Ser His Gin Arg Arg His Tyr Lys Glu Phe 
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FIGURE 5C 



1268 ± 2 1Z 



« e ss a as a ss a a s a » a 
*» a as a ss* s ss a as a a 

1358 1367 
1331 "40 -m oul ATC ATC AAG GAA TAC 

a a a a a a sssa ~ oiu ™ 

1412 

1376 1385 „ TTC TTG L TTA GAC ACA AGA AAG GCC 

ACA AAT AGG GAT GCA GAT CTG TTC TTG ^ ^ ^ Arg ^ Ala 
Thr Asn Arg Asp Ala Asp 

1448 14S7 

Is s-as* a ss s a a s a a a a 

1493 1502 
1475 1484 *»#r A&T TTG GGC TTA CAG 

ss a s s a a a a a a a «- 

Tft^R 154*7 
1511 152° ^ 1 52 < 1 rex CGC ACT ATC AAC GTA AAA TCT 

Is s; a a a ss a a s ~ — vsi ws s * r 

,503 1592 
ice* 1565 1574 _ SSr TCA AAA- CAA CCA TTC 

s a a a a ^ a a - a is a ~» «• - 

1628 1637 

*ssi ox.. « i a S c « -sss a a a 

MET Val Ala Phe PHe Xys Ala ser ox 

1 £7 3 1682 

l i si a a a as a a a a s^s 

171B 1727 
1691 I 700 W p TCC AGT GTT GGA GAT TAT AAC 

X JS CAT CAG GAC TCC TCCAGA ATG TCC AGT ^ ^ ^ ^ Asn 

-.1 THr T?- ft°T ? ser Ssr ( j5? j 
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FIGURE 5D 



1736 1745 1754 

ACA A6T GAG CAA AAA CAA GCC 
Thr Ser Glu Gin Lys Gin Ala 



1781 1790 1799 

AGC TTC CGG GAT CTG GGA TGG 
Ser Phe Arg Asp Leu Gly Trp 
(362) 

1826 1835 1844 

GGA TAC GCT GCA TTT TAT TGT 

Gly Tyr Ala Ala Phe Tyr Cye 

1871 1880 1889 

AAC GCC CAT ATG AAT GCC ACC 
Asn Ala His MET Asn Ala Thr 

1916 1925 1934 

GTT CAT CTG ATG TTT CCT GAC 
val His Leu MET Phe Pro Asp 

1961 1970 1979 

CCA ACC AAA TTA AAT GCC ATC 
Pro Thr Lys Leu Asn Ala lie 

2006 2015 2024 

TCC AAT GTC ATT TTG AAA AAA 
ser Asn Val He Leu Lys Lys 



1763 1772 
TGT AAG AAG CAC GAA CTC TAT GTG 
Cys Lys Lys His Glu Leu Tvr Val 
(356) 

1808 1817 
CAG GAC TGG ATT ATA GCA CCA GAA 
Gin Asp Trp He Zle Ala Pro Glu 



1853 1862 
GAT GGA GAA TGT TCT TTT CCA CTT 
Asp Gly Glu Cys Ser Phe Pro Leu 

1898 1907 
AAC CAC GCT ATA GTT CAG ACT CTG 
Asn His Ala Zle Val Gin Thr Leu 

1943 1952 
CAC GTA CCA AAG CCT TGT TGT GCT 
His Val Pro Lys Pro Cys Cys Ala 

1988 1997 
TCT GTT CTG TAC TTT GAT GAC AGC 
Ser Val Leu Tyr Phe Asp Asp ser 

2033 2042 
TAT AGA AAT ATG GTA GTA CGC TCA 
Tyr Arg Asn MET Val Val Arg Ser 



2051 2060 2070 2080 2090 2100 

TGT GGC TGC CAC TAATATTAAA TAATATTGAT AATAACAAAA AGATCTGTAT 
Cys Gly Cys His 
(454) 

2110 2120 2130 2140 2150 

TAAGGTTTAT GGCTGCAATA AAAAGCATAC TTTCAGACAA ACAGAAAAAA AAA 
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Figure 6 

(1) 

GAATTCC GAG CCC CAT TGG AAG GAG TTC CGC TTT GAC CTG ACC CAG ATC CCG GCT 

Glu Pro His Trp Lys Glu Phe Arg Phe Asp Leu Thr Gin lie Pro Ala 

(10) 

GGG GAG GCG GTC ACA GCT GCG GAG TTC CGG ATT TAC AAG CTG CCC AGC ATC CAC 
Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr Lys Val Pro Ser lie His 
(20) (30) 

CTG CTC AAC AGG ACC CTC CAC GTC AGC ATG TTC CAG GTG GTC CAG GAG CAG TCC 
Leu Leu Asn Arg Thr Leu His Val ser Met Phe Gin val Val Gin Glu Gin Ser 

(40) (50) 

AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT CTT CAG ACG CTC CGA GCT GGA GAC 
Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin Thr Leu Arg Ala Gly Asp 

(60) (70) 

GAG GGC TGG CTG GTG CTG GAT GTC ACA GCA GCC AGT GAC TGC TGG TTG CTG AAG 

Glu Gly Typ Leu Val Leu Asp Val Thr Ala Ala Ser Asp Cyc Trp Leu Leu Lys 

(BO) 

CGT CAC AAG GAC CTC GGA CTC CGC CTC TAT GTG GAG ACT GAG GAT GGG CAC AGC 

Arg His Lys Asp Leu Gly Lue Arg Leu Tyr val Glu Thr Glu Asp Gly His Ser 
(90) ^ (100) 

GTG GAT CCT GGC CTG GCC GGC CTG CTG GGT CAA CGG GCC CCA CGC TCC CAA CAG 
Val Asp Pro Gly Leu Ala Gly Leu i„eu Gly Gin Arg Ala Pro Arg Ser Gin Gin 
(110) (120) 

CCT TTC GTG GTC ACT TTC TTC AGG GCC ACT CCG AGT CCC ATC CGC ACC CCT CGG 
Pro Phe Val Val Thr Phe Phe Arg Ala Ser Pro Ser Pro lie Arg Thr Pro Arg 

(130) (140) 

GCA GTG AGG CCA CTG AGG AGG AGG CAG CCG AAG AAA AGC AAC GAG CTG CCG CAG 
Ala Val Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys ser Asn Glu Leu Pro Gin 

(150) (160) 

GCC AAC CGA CTC CCA GGG ATC TTT GAT GAC GTC CAC GGC TCC CAC GGC CGG CAG 
Ala Asn Arg Leu Pro Gly lie Phe Asp Asp Val His Gly Ser His Gly Arg Gin 

(170) 

GTC TGC CGT CGG CAC GAG CTC TAC GTC AGC TTC CAG GAC CTT GGC TGG CTG GAC 
Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Leu Asp 
(180) (190) 

TGG GTC ATC GCC CCC CAA GGC TAC TCA GCC TAT TAC TGT GAG GGG GAG TGC TCC 
Trp Val He Ala Pro Gin Gly Tyr Ser Ala Tyr Tyr Cys Glu Gly Glu Cys Ser 
(200) (210) 

TTC CCG CTG CAC TCC TGC ATG AAC GCC ACC AAC CAC GCC ATC CTG CAG TCC CTG 
Phe Pro Leu Asp Ser Cys Met Asn Ala Thr Asn His Ala lie Leu Gin Ser Leu 

(220) (230) 
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Figure 6 (Con't) 



GTG CAC CTG ATG AAG CCA AAC GCA GTC CCC AAG GCG TGC TGT GCA CCC ACC AAG 
Val His Leu Met Lys Pro Asn Ala Val Pro Lys Ala Cys Cys Ala Pro Thr Lys 

(240) (250) 

CTG AGC GCC ACC TCT GTG CTC TAG TAT GAC AGC AGC AAC AAC GTC ATC CTG CGC 
Leu Ser Ala Thr Ser Val Leu Tyr Tyr Asp Ser Ser Asn Afin Val lie Leu Arg 

(260) 

AAG CAC CGC AAC ATG GTG GTC AAG GCC TGC GGC TGC CAC TGAGTCAGCCCGCCCAGC 

Lys His Arg Asn Met: Val Val Lys Ala cys Gly cys His 

(270) (280) 
CCTACTGCAGCCACCCTTCTCATCTGGATCGGGCCCTGCAGAGGCAGAAAACCCTTAAATGCTGTCACAG 
CTCAAGCAGGAGTGTCAGGGGCCCTCACTCTCGGTGCCTACTTCCTGTCAGGCTTCTGGGAATTC 
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FXGUR2 7 (cont'd) 
TTACOGCCn TOAOTOACC* OATACeCCTC OCCCCAOCCC AACOACCCAO COCACOOAOT 1980 
CAQTOACCOA GGAAGCOOAA OA0C9CCCAA TAOOCAAACC OCCTCTCCCC OCOOOTTOCC 3040 
COATTCAtTA ATOCAOAATT CATCTCTCAC CTACCAAACA ATOCCCCCCT CC A AAAAATA 2100 
AATTCATATA AAAAAQATAC AGAtAACCA? CTOCOOTOAT AAATTATCTC TOCCOOTOT* 2160 
OACATAAATA OGACT06C0O TOATAMCA0 CACATCAflCA OOACOGACTG ACCACCATOA 2320 
AQGTOA00CT CTTAAAAACT AAOCCCTOAA OAAOOOCAOC ATTCAAAOCA OAAOOCTTTO 2280 
OOOTCTCTOA TACCAAACOA AOCATTOOCC OTAAGTOCOA TTCCOOATTA OCTOOGAATO 2340 
TCCCAATOOC OOOOOCWT? OCTTCAOOAC TACAACTOCC ACACACCACC AAAOCTAACT 2400 
OACAOOAOAA TCCAOATOOA TOCACAAACA COCOOCGOOG AACOTCOOCC AOAQAAACAO 2400 
OCTCAASOOA AACCAOCAAA TCCCCTOTTO OTTOOOOTAA GOCCAAAACC A9TTOOGAAA 2S20 
OATTTTTTTA ACTATAAACO CTOATGOAAO OOtTTATOCC 0AA9AGCTAA AOCCCTTCCC 2 9 BO 
6ACTAACAAA AAAACAACA0 CXTAAATAAC COCOCTCWA CACATTCCAO CCCTOAAAAA 2440 
OGGCATCAAA TTAAACCACA CCIATGOTOT ATCCATTTAT TTOCATACAT TCAATOAATT 3700 
OTTAtCTAAO OAAATACTTA GATATOCAM CTAAACATAA ACAACOTAAA COTCTOAAAT 2760 
CTAOCTOTAA OAOACAOCCT TTOTAOOTGO ACTTCAOTOA CGTGG09TG6 AAMAOTMA 2820 
TTOTGCCTCC CCCOOOOtAT CACOCCTTTT ACVOOCAOGO A0AA1CCCCT TTTCCTCTOO 2860 
CTOATCATCT GAACTCCACT AATCATOCCA TTOTTCAQAC OTTCOTCAAC TCTOTTAACT 2940 
CTAAOATTCC TAAOOCATOC TOTOTOCOGA CAOAACTCAO TOCTATCTCO ATOCTOTACC 3000 
MOAOOAOAA TOAAAAGOTT OTATTAAAGA ACTATCAOOA CATOOTTGTO GAOGCTTOTG 3080 
GOMTOQCTA OTACAOQAAA ATTAAATACA TAAATA7ATA TATATATATA TATTTTAPAA 3120 
AAAAOAAAAA AATCTAOAOT OOACCTOCAG TAATCOTACA OOOTAOTAGA AATAAAAAAO 3180 
OCAOGTCAOA TOACQTOCCT TOTTTCTTOT GAOCAQTAAO CTTOOCACTO OCCOTCOTTT 3240 
TACAACOTCG TOACTOOOAA AACCCTOOOO TTA C CCAA C T TAATCOCCTT OCAOCACATC 3300 
CCCCTTSOOC CAGCTCGCOT AATAOCOAAO AOOCCOQCAC COATC CCCC T TCCCAACAOT 3360 
TCCO C AO C eT oaatooooaa toocodotca tooootattt TCTCCTTAOO GATCTOtCCO 3420 
GTATMCACA CCOCATATAT OOTOOACTCt CAOTACAATC TOCTCTOATO CCOCATAOTT 3460 
AAOCCAOCCO CGAGACCCOC CAACACCCOC T OA CO COCC C TOAOOOOCTT QTCTCCtCCC 3540 
OOCATCCOCT TACAOACAAG CTOTOACCOT CTOCGOCAOC TOCAXOTOTC AOAOOTRTC 3600 
ACCOTCAtCA CCOAAACCCO OOA 3623 
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FIGURE 8 



W-20 ALKALINE PHOSPHATASE: BMP-2 VS. BMP-2/7 
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FIGURE 9 

EFFECTS OF BMP-2 AND BMP2/7 ON BGP SYNTHESIS 
BY W-20 CELLS 
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FIGURE 10 



COMPARAISON OF E.Coli BMP-2 AND BMP-2/7: 
W-20-17 ALKALINE PHOSPHATASE 
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FIGURE 11A 



10 20 30 40 50 60 70 

AGATCITGAA AAGAOO0QGG CCACACAOGC OGaSAOCTAC AGCTCTTTCT CAGOGfTTOGA GIGGAGACGG 



80 90 100 110 120 130 140 

OGOOOGCAGC GOCCTGCGOS GGTGAGGTOC GOGCAGCTGC T3GQGAAGAG CDOUXTTOIC AGGCIGOGCT 



150 160 170 180 190 200 210 

OGGTCAG03C AGCAACTGGG GCIGGOCGCT ATCT0GCIGC AO0OGGO0GC CTOCXX5GGCT OOSTOOGOOC 



220 230 240 250 260 270 280 

TOGCOOCAGC TGCTITQGAG TTCAAOOCIC GGCTOC G CCG O05GC 1 UU 1 T GOGOCTT0GG ACTGTO00GC 



290 300 310 320 (1) 335 

AGOGAOGCOG GGAGOOGAOG O3O0GOGC33G CTAGCEAGCX! ATG GCP GGG GOG AGC AGG CIG CTC 

MET Ala Gly Ala Ser Arg Leu Leu 

350 365 380 395 

TIT CIG TGG CIG GGC TOC TIC TGC GIG AGC CIG GOG CAG GGA GAG AGA COG AAG CCA 
Phe leu Trp Leu Gly Cys Hie Cys Val Ser Leu Ala Girt Gly Glu Arg Pro lys Pro 

410 425 440 455 

CCT TIC COG GAG CTC OSC AAA GCT GIG OCA GGT GAC OGC AOG GCAGGT OCT GGC COG 
Pro Phe Pro Glu Leu Arg lys Ala Val Pro Gly Asp Arg Thr Ala Gly Gly Gly Pro 

470 485 500 515 

GAC TOC GAG CTC CftG COG CAA GAC AAG GIC TCT GAA CAC AUG CTC CGG CTC TAT GAC 
Asp Ser Glu Leu Gin Pro Gin Asp lys Val Ser Glu His MET Leu Arg T«=»n Tyr Asp 

530 545 560 

AGG TAG AGC AOG GIC CAG GOG GCC CGG AGA O0G GGC TOC CTG GAG GGA GGC TOG CAG 
Arg Tyr Ser Thr Val Gin Ala Ala Arg Thr Pro Gly Ser Leu Glu Gly Gly Ser Gin 

575 590 605 620 

OOC TOG CGC OCT CGG CDC CTG OGC GAA GGC AAC AOG GTT GGC AGC ITT COG GCG GCA 

Pro Trp Arg Pro Arg leu leu Arg Glu Gly Asn Thr Val Arg Ser Phe Arg Ala Ala 

635 650 665 680 

GCA GCA GAA ACT CTT GAA AGA AAA GGA CTG TAT ATC TIC AAT CTG ACA TOG CTA ACC 
Ala Ala Glu Thr Leu Glu Arg Lys Gly Leu Tyr lie Phe Asn Leu Ttxr Ser Leu Thr 

695 710 725 740 

AAG TCT GAA AAC ATT TTC TCT GCC ACA CTG TAT TIC TCT ATT GGA GAG CIA GGA AAC 
Lys Ser Glu Asn He Leu Ser Ala Thr leu Tyr Phe cys He Gly Glu Leu Gly Asn 
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FIGURE 11C 



1430 1445 (377) 1460 1475 

TCC GCC AGG AGA TAC CTC AAG GEA GAC TIT GCA GAT ATT QSC TOG AST GAA TOG ATT 
cys Ala Arg Arg Tyr Leu lys Val Asp T*** ftso Tie Glv Tcp Ser Glu Trp Xle 

1490 1505 1520 1535 

ATC TOC OCT? AAG TOC TTT GAT GOC TAT TAT TCC TCP QGA GCA TCC GAG TIC OOC ATC 
Xle Ser Pro lys Ser pie Ago Ala Tvr ivr Cvs ser Glv Ala Cys Gin I%ie Pro MET 

1550' 1565 1580 1595 

OCA AAG TCT TIG AAG OCA TCA AAT CAT GCT AOC ATC CAG ACT ATA GIG AGA GCT GIG 
Pro lys Ser Iai Ia^ Pro Ser Asn His Ala T^r lift Gin Ser Tie Val Arg Ala Val 

1610 1625 1640 1655 

GGG arc GIT OCT GGG AIT OCT GAG OCT TCC TOT OTA OCA GAA AAG ATC TOC TCA CTC 
Gly Val Val Pro Gly He Pro Glu Pro Cys Cys Val Pro Glu lys MET Ser Ser Leu 

1670 X685 1700 

AGT AST TEA TIC TTT GAT GAA AAT AAG AAT CTA GIG CXT AAA GEA TAC OCT AAC ATC 
Ser lie Leu Fhe Rie Asp Glu Asn Lys Asn Val Val lea Lys Val Tvr Pre Asn MET 

1715 1730 (472) 1746 1756 1766 1776 

ACA GEA GAG TCT TCC GCT TCC AGA TAA0CT9GCA AAGAACTGAT TTGAATGCIT AAITCAATCT 
Ttir Val Glu Ser Cvs Ala Cys Arg 

1786 

CEAGAGIOGA CGGAATIC 
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Figure 12 



W-20 ALKALINE PHOSPHATASE CHO BMP-3/6 vs. CHO BMP-2 
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FIGURE 13A 
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FIGURE 13B 
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FIGURE 14A 
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FIGURE 14B 
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